{
 "cells": [
  {
   "cell_type": "markdown",
   "id": "5ce90cbc",
   "metadata": {},
   "source": [
    "* 城市：'全国'、 '北京、 '上海'、 '天津'、 '重庆'、 '广州'、'深圳'、 '苏州'、 '南京'、 '杭州'、 '大连'、'成都'、 '武汉'、 '西安'\n",
    "* 关键词：不限制\n",
    "* 工作经验：'应届生'、 '实习生'、 '1年以内'、 '1-3年'、'3-5年'、 '5-10年'、'10年以上'\n",
    "* 公司规模：1-49人、50-99人、100-499人、500-999人、1000-2000人、2000-5000人、5000-10000人、10000人以上"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 1,
   "id": "6ac5b3c7",
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "请输入城市：深圳\n",
      "请输入职位：产品经理\n",
      "这是第1页，接下来将先等待7秒...然后以继续抓取\n",
      "这是第2页，接下来将先等待7秒...然后以继续抓取\n",
      "这是第3页，接下来将先等待7秒...然后以继续抓取\n",
      "这是第4页，接下来将先等待7秒...然后以继续抓取\n",
      "这是第5页，接下来将先等待9秒...然后以继续抓取\n",
      "这是第6页，接下来将先等待7秒...然后以继续抓取\n",
      "这是第7页，接下来将先等待4秒...然后以继续抓取\n",
      "这是第8页，接下来将先等待7秒...然后以继续抓取\n",
      "这是第9页，接下来将先等待9秒...然后以继续抓取\n",
      "这是第10页，接下来将先等待3秒...然后以继续抓取\n"
     ]
    },
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>dataInfo</th>\n",
       "      <th>dataParams</th>\n",
       "      <th>job.labels</th>\n",
       "      <th>job.requireWorkYears</th>\n",
       "      <th>job.requireEduLevel</th>\n",
       "      <th>job.dq</th>\n",
       "      <th>job.jobKind</th>\n",
       "      <th>job.jobId</th>\n",
       "      <th>job.refreshTime</th>\n",
       "      <th>job.title</th>\n",
       "      <th>...</th>\n",
       "      <th>recruiter.imUserType</th>\n",
       "      <th>recruiter.recruiterPhoto</th>\n",
       "      <th>recruiter.recruiterId</th>\n",
       "      <th>comp.compStage</th>\n",
       "      <th>comp.compLogo</th>\n",
       "      <th>comp.compId</th>\n",
       "      <th>comp.compScale</th>\n",
       "      <th>comp.compName</th>\n",
       "      <th>comp.compIndustry</th>\n",
       "      <th>comp.link</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>%7B%22sfrom%22%3A%22search_job_pc%22%2C%22jobK...</td>\n",
       "      <td>{\"recruiterName\":\"王女士\",\"jobKind\":\"2\",\"jobId\":\"...</td>\n",
       "      <td>[电子/半导体/集成电路, 产品规划, 需求分析, 用户研究, 市场调研, 数据分析, 存储...</td>\n",
       "      <td>5-10年</td>\n",
       "      <td>统招本科</td>\n",
       "      <td>深圳-南山区</td>\n",
       "      <td>2</td>\n",
       "      <td>58727339</td>\n",
       "      <td>20231103092225</td>\n",
       "      <td>PM产品经理</td>\n",
       "      <td>...</td>\n",
       "      <td>2</td>\n",
       "      <td>64edb1871913aa27489a503a05u.png</td>\n",
       "      <td>ccdad52c2bbc660a07efab5c7119dcac</td>\n",
       "      <td>A轮</td>\n",
       "      <td>62f463c85732892a6615046607u.png</td>\n",
       "      <td>13146105.0</td>\n",
       "      <td>50-99人</td>\n",
       "      <td>联和存储科技(江苏)有限公司</td>\n",
       "      <td>电子/半导体/集成电路</td>\n",
       "      <td>https://www.liepin.com/company/13146105/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>%7B%22sfrom%22%3A%22search_job_pc%22%2C%22jobK...</td>\n",
       "      <td>{\"recruiterName\":\"丘女士\",\"jobKind\":\"2\",\"jobId\":\"...</td>\n",
       "      <td>[产品经理, 食品产品经理]</td>\n",
       "      <td>5-10年</td>\n",
       "      <td>本科</td>\n",
       "      <td>深圳-宝安区</td>\n",
       "      <td>2</td>\n",
       "      <td>62599965</td>\n",
       "      <td>20231102203839</td>\n",
       "      <td>产品经理 (MJ000380)</td>\n",
       "      <td>...</td>\n",
       "      <td>2</td>\n",
       "      <td>60d15c9a25b2480b7839087707u.png</td>\n",
       "      <td>84154b1181cb0bd149549ef892d00baf</td>\n",
       "      <td>NaN</td>\n",
       "      <td>5bfe9ab674719df0ac99e25b03a.png</td>\n",
       "      <td>7286569.0</td>\n",
       "      <td>2000-5000人</td>\n",
       "      <td>卡士酸奶</td>\n",
       "      <td>食品/饮料/酒水</td>\n",
       "      <td>https://www.liepin.com/company/7286569/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>%7B%22sfrom%22%3A%22search_job_pc%22%2C%22jobK...</td>\n",
       "      <td>{\"recruiterName\":\"郑女士\",\"jobKind\":\"2\",\"jobId\":\"...</td>\n",
       "      <td>[平台产品, 竞品分析, 策略产品, 商业产品, 产品优化, 产品规划, 产品设计, 医疗行...</td>\n",
       "      <td>5-10年</td>\n",
       "      <td>本科</td>\n",
       "      <td>深圳-光明区</td>\n",
       "      <td>2</td>\n",
       "      <td>62389205</td>\n",
       "      <td>20231023105635</td>\n",
       "      <td>高级产品经理</td>\n",
       "      <td>...</td>\n",
       "      <td>2</td>\n",
       "      <td>5f8f9865ea60860b75384fa508u.jpg</td>\n",
       "      <td>4b5b2f7a74ebead3aadebbe992efa486</td>\n",
       "      <td>天使轮</td>\n",
       "      <td>636488445fd2f55ef5a3291b07u.png</td>\n",
       "      <td>12787419.0</td>\n",
       "      <td>100-499人</td>\n",
       "      <td>倍生生物科技(深圳)有限公司</td>\n",
       "      <td>生物技术</td>\n",
       "      <td>https://www.liepin.com/company/12787419/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>%7B%22sfrom%22%3A%22search_job_pc%22%2C%22jobK...</td>\n",
       "      <td>{\"recruiterName\":\"陈先生\",\"jobKind\":\"2\",\"jobId\":\"...</td>\n",
       "      <td>[数据产品, 数据挖掘, 数据分析, 产品规划]</td>\n",
       "      <td>5-10年</td>\n",
       "      <td>学历不限</td>\n",
       "      <td>深圳</td>\n",
       "      <td>2</td>\n",
       "      <td>36908035</td>\n",
       "      <td>20210311143004</td>\n",
       "      <td>Lazada-Data Product Manager-流量数据产品经理（深圳）</td>\n",
       "      <td>...</td>\n",
       "      <td>2</td>\n",
       "      <td>57c4e9d345ce9f0a74a9b8cd04a.png</td>\n",
       "      <td>d4be0c7af21e622076f4668dbe33f725</td>\n",
       "      <td>港股上市</td>\n",
       "      <td>5bfe915474719d35745acf0903a.png</td>\n",
       "      <td>1072424.0</td>\n",
       "      <td>10000人以上</td>\n",
       "      <td>阿里巴巴集团</td>\n",
       "      <td>互联网</td>\n",
       "      <td>https://www.liepin.com/company/1072424/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>%7B%22sfrom%22%3A%22search_job_pc%22%2C%22jobK...</td>\n",
       "      <td>{\"recruiterName\":\"郑女士\",\"jobKind\":\"2\",\"jobId\":\"...</td>\n",
       "      <td>[数据产品, 供应链产品, 电子/半导体/集成电路, 电子商务, 产品规划, 用户研究, 市...</td>\n",
       "      <td>5-10年</td>\n",
       "      <td>本科</td>\n",
       "      <td>深圳</td>\n",
       "      <td>2</td>\n",
       "      <td>44250585</td>\n",
       "      <td>20231016141158</td>\n",
       "      <td>产品经理 PM（NXP）</td>\n",
       "      <td>...</td>\n",
       "      <td>2</td>\n",
       "      <td>5f8f9866dfb13a7dee342f1808u.jpg</td>\n",
       "      <td>03a35b280d00a67bde30689f01e9fe19</td>\n",
       "      <td>NaN</td>\n",
       "      <td>5bfe9bce74719df0ac9a179703a.png</td>\n",
       "      <td>8523053.0</td>\n",
       "      <td>500-999人</td>\n",
       "      <td>中电港</td>\n",
       "      <td>电子/半导体/集成电路</td>\n",
       "      <td>https://www.liepin.com/company/8523053/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>...</th>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>35</th>\n",
       "      <td>%7B%22jobKind%22%3A%221%22%2C%22d_curPage%22%3...</td>\n",
       "      <td>{\"jobKind\":\"1\",\"imId\":\"ba0a96f6ae59f7bc78f5885...</td>\n",
       "      <td>[硬件产品, 定价/竞价产品, 策略产品, 硬件系统集成, 智能硬件, 产品规划, 产品设计...</td>\n",
       "      <td>1-3年</td>\n",
       "      <td>本科</td>\n",
       "      <td>深圳-宝安区</td>\n",
       "      <td>1</td>\n",
       "      <td>48830049</td>\n",
       "      <td>20230823112426</td>\n",
       "      <td>空气净化器产品经理</td>\n",
       "      <td>...</td>\n",
       "      <td>2</td>\n",
       "      <td>5ed07edb15b5366045ca70bf06u.jpg</td>\n",
       "      <td>8e77aa8e53cf6a9405ab705a19c04428</td>\n",
       "      <td>A轮</td>\n",
       "      <td>60e2fe0bf3df194a3c48adb502u.png</td>\n",
       "      <td>NaN</td>\n",
       "      <td>100-499人</td>\n",
       "      <td>某北京电子商务公司</td>\n",
       "      <td>电子商务</td>\n",
       "      <td></td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>36</th>\n",
       "      <td>%7B%22jobKind%22%3A%222%22%2C%22d_curPage%22%3...</td>\n",
       "      <td>{\"jobKind\":\"2\",\"imId\":\"720f553e8eb55bea69df241...</td>\n",
       "      <td>[供应链产品, 数据产品, AI人工智能产品, 物流快递, 交通/运输/物流, IT互联网,...</td>\n",
       "      <td>3-5年</td>\n",
       "      <td>本科</td>\n",
       "      <td>深圳-南山区</td>\n",
       "      <td>2</td>\n",
       "      <td>58191709</td>\n",
       "      <td>20230427154916</td>\n",
       "      <td>JTY-产品经理</td>\n",
       "      <td>...</td>\n",
       "      <td>2</td>\n",
       "      <td>5f8f98698dbe6273dcf8516008u.jpg</td>\n",
       "      <td>81619137940fa509af010da14d9775a4</td>\n",
       "      <td>NaN</td>\n",
       "      <td>5bfe980c74719df0ac995aad03a.png</td>\n",
       "      <td>8353564.0</td>\n",
       "      <td>2000-5000人</td>\n",
       "      <td>顺丰科技</td>\n",
       "      <td>互联网</td>\n",
       "      <td>https://www.liepin.com/company/8353564/</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>37</th>\n",
       "      <td>%7B%22jobKind%22%3A%221%22%2C%22d_curPage%22%3...</td>\n",
       "      <td>{\"jobKind\":\"1\",\"imId\":\"820b5299af6b1f62bb772d4...</td>\n",
       "      <td>[OA产品, OA系统, 批发/零售, 交互设计, 产品设计, 产品规划, Axure, M...</td>\n",
       "      <td>经验不限</td>\n",
       "      <td>本科</td>\n",
       "      <td>深圳-福田区</td>\n",
       "      <td>1</td>\n",
       "      <td>48773967</td>\n",
       "      <td>20230914154230</td>\n",
       "      <td>高级产品经理</td>\n",
       "      <td>...</td>\n",
       "      <td>2</td>\n",
       "      <td>60d3f0f2d2af6b6feaafcfb706u.png</td>\n",
       "      <td>90f7d3ed956e5da88d49e3349302ee7c</td>\n",
       "      <td>已上市</td>\n",
       "      <td>60e2fe0bf3df194a3c48adb502u.png</td>\n",
       "      <td>NaN</td>\n",
       "      <td>10000人以上</td>\n",
       "      <td>某知名上市公司</td>\n",
       "      <td>互联网</td>\n",
       "      <td></td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>38</th>\n",
       "      <td>%7B%22jobKind%22%3A%221%22%2C%22d_curPage%22%3...</td>\n",
       "      <td>{\"jobKind\":\"1\",\"imId\":\"4e2ea5af3c8dd8fbe030375...</td>\n",
       "      <td>[交易产品, IT互联网, 证券业, 团队管理]</td>\n",
       "      <td>5-10年</td>\n",
       "      <td>本科</td>\n",
       "      <td>深圳-福田区</td>\n",
       "      <td>1</td>\n",
       "      <td>48620681</td>\n",
       "      <td>20230815140559</td>\n",
       "      <td>交易系统服务产品经理</td>\n",
       "      <td>...</td>\n",
       "      <td>2</td>\n",
       "      <td>64743a0fe549742f9ba314b405u.png</td>\n",
       "      <td>ea024ead13082e1ebe9353fa8b0496c7</td>\n",
       "      <td>融资未公开</td>\n",
       "      <td>60e2fe0bf3df194a3c48adb502u.png</td>\n",
       "      <td>NaN</td>\n",
       "      <td>10000人以上</td>\n",
       "      <td>某知名公司</td>\n",
       "      <td>电子商务</td>\n",
       "      <td></td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>39</th>\n",
       "      <td>%7B%22jobKind%22%3A%222%22%2C%22d_curPage%22%3...</td>\n",
       "      <td>{\"jobKind\":\"2\",\"imId\":\"2b172403a889c4b8befa578...</td>\n",
       "      <td>[支付产品, to G, to B, 消费券, 政府权益, 银行合作]</td>\n",
       "      <td>1-3年</td>\n",
       "      <td>本科</td>\n",
       "      <td>深圳</td>\n",
       "      <td>2</td>\n",
       "      <td>61037495</td>\n",
       "      <td>20230823102535</td>\n",
       "      <td>权益产品经理 (MJ001105)</td>\n",
       "      <td>...</td>\n",
       "      <td>2</td>\n",
       "      <td>5f8f9866dfb13a7dee342f1808u.jpg</td>\n",
       "      <td>a16b12ed23e0d953c0d85e41c4b7b278</td>\n",
       "      <td>创业板上市</td>\n",
       "      <td>5c88aad9509919c4f51cbe9308u.jpg</td>\n",
       "      <td>855567.0</td>\n",
       "      <td>5000-10000人</td>\n",
       "      <td>深圳市新国都股份有限公司</td>\n",
       "      <td>智能硬件</td>\n",
       "      <td>https://www.liepin.com/company/855567/</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "<p>400 rows × 31 columns</p>\n",
       "</div>"
      ],
      "text/plain": [
       "                                             dataInfo  \\\n",
       "0   %7B%22sfrom%22%3A%22search_job_pc%22%2C%22jobK...   \n",
       "1   %7B%22sfrom%22%3A%22search_job_pc%22%2C%22jobK...   \n",
       "2   %7B%22sfrom%22%3A%22search_job_pc%22%2C%22jobK...   \n",
       "3   %7B%22sfrom%22%3A%22search_job_pc%22%2C%22jobK...   \n",
       "4   %7B%22sfrom%22%3A%22search_job_pc%22%2C%22jobK...   \n",
       "..                                                ...   \n",
       "35  %7B%22jobKind%22%3A%221%22%2C%22d_curPage%22%3...   \n",
       "36  %7B%22jobKind%22%3A%222%22%2C%22d_curPage%22%3...   \n",
       "37  %7B%22jobKind%22%3A%221%22%2C%22d_curPage%22%3...   \n",
       "38  %7B%22jobKind%22%3A%221%22%2C%22d_curPage%22%3...   \n",
       "39  %7B%22jobKind%22%3A%222%22%2C%22d_curPage%22%3...   \n",
       "\n",
       "                                           dataParams  \\\n",
       "0   {\"recruiterName\":\"王女士\",\"jobKind\":\"2\",\"jobId\":\"...   \n",
       "1   {\"recruiterName\":\"丘女士\",\"jobKind\":\"2\",\"jobId\":\"...   \n",
       "2   {\"recruiterName\":\"郑女士\",\"jobKind\":\"2\",\"jobId\":\"...   \n",
       "3   {\"recruiterName\":\"陈先生\",\"jobKind\":\"2\",\"jobId\":\"...   \n",
       "4   {\"recruiterName\":\"郑女士\",\"jobKind\":\"2\",\"jobId\":\"...   \n",
       "..                                                ...   \n",
       "35  {\"jobKind\":\"1\",\"imId\":\"ba0a96f6ae59f7bc78f5885...   \n",
       "36  {\"jobKind\":\"2\",\"imId\":\"720f553e8eb55bea69df241...   \n",
       "37  {\"jobKind\":\"1\",\"imId\":\"820b5299af6b1f62bb772d4...   \n",
       "38  {\"jobKind\":\"1\",\"imId\":\"4e2ea5af3c8dd8fbe030375...   \n",
       "39  {\"jobKind\":\"2\",\"imId\":\"2b172403a889c4b8befa578...   \n",
       "\n",
       "                                           job.labels job.requireWorkYears  \\\n",
       "0   [电子/半导体/集成电路, 产品规划, 需求分析, 用户研究, 市场调研, 数据分析, 存储...                5-10年   \n",
       "1                                      [产品经理, 食品产品经理]                5-10年   \n",
       "2   [平台产品, 竞品分析, 策略产品, 商业产品, 产品优化, 产品规划, 产品设计, 医疗行...                5-10年   \n",
       "3                            [数据产品, 数据挖掘, 数据分析, 产品规划]                5-10年   \n",
       "4   [数据产品, 供应链产品, 电子/半导体/集成电路, 电子商务, 产品规划, 用户研究, 市...                5-10年   \n",
       "..                                                ...                  ...   \n",
       "35  [硬件产品, 定价/竞价产品, 策略产品, 硬件系统集成, 智能硬件, 产品规划, 产品设计...                 1-3年   \n",
       "36  [供应链产品, 数据产品, AI人工智能产品, 物流快递, 交通/运输/物流, IT互联网,...                 3-5年   \n",
       "37  [OA产品, OA系统, 批发/零售, 交互设计, 产品设计, 产品规划, Axure, M...                 经验不限   \n",
       "38                           [交易产品, IT互联网, 证券业, 团队管理]                5-10年   \n",
       "39                [支付产品, to G, to B, 消费券, 政府权益, 银行合作]                 1-3年   \n",
       "\n",
       "   job.requireEduLevel  job.dq job.jobKind job.jobId job.refreshTime  \\\n",
       "0                 统招本科  深圳-南山区           2  58727339  20231103092225   \n",
       "1                   本科  深圳-宝安区           2  62599965  20231102203839   \n",
       "2                   本科  深圳-光明区           2  62389205  20231023105635   \n",
       "3                 学历不限      深圳           2  36908035  20210311143004   \n",
       "4                   本科      深圳           2  44250585  20231016141158   \n",
       "..                 ...     ...         ...       ...             ...   \n",
       "35                  本科  深圳-宝安区           1  48830049  20230823112426   \n",
       "36                  本科  深圳-南山区           2  58191709  20230427154916   \n",
       "37                  本科  深圳-福田区           1  48773967  20230914154230   \n",
       "38                  本科  深圳-福田区           1  48620681  20230815140559   \n",
       "39                  本科      深圳           2  61037495  20230823102535   \n",
       "\n",
       "                                   job.title  ...  recruiter.imUserType  \\\n",
       "0                                     PM产品经理  ...                     2   \n",
       "1                            产品经理 (MJ000380)  ...                     2   \n",
       "2                                     高级产品经理  ...                     2   \n",
       "3   Lazada-Data Product Manager-流量数据产品经理（深圳）  ...                     2   \n",
       "4                               产品经理 PM（NXP）  ...                     2   \n",
       "..                                       ...  ...                   ...   \n",
       "35                                 空气净化器产品经理  ...                     2   \n",
       "36                                  JTY-产品经理  ...                     2   \n",
       "37                                    高级产品经理  ...                     2   \n",
       "38                                交易系统服务产品经理  ...                     2   \n",
       "39                         权益产品经理 (MJ001105)  ...                     2   \n",
       "\n",
       "           recruiter.recruiterPhoto             recruiter.recruiterId  \\\n",
       "0   64edb1871913aa27489a503a05u.png  ccdad52c2bbc660a07efab5c7119dcac   \n",
       "1   60d15c9a25b2480b7839087707u.png  84154b1181cb0bd149549ef892d00baf   \n",
       "2   5f8f9865ea60860b75384fa508u.jpg  4b5b2f7a74ebead3aadebbe992efa486   \n",
       "3   57c4e9d345ce9f0a74a9b8cd04a.png  d4be0c7af21e622076f4668dbe33f725   \n",
       "4   5f8f9866dfb13a7dee342f1808u.jpg  03a35b280d00a67bde30689f01e9fe19   \n",
       "..                              ...                               ...   \n",
       "35  5ed07edb15b5366045ca70bf06u.jpg  8e77aa8e53cf6a9405ab705a19c04428   \n",
       "36  5f8f98698dbe6273dcf8516008u.jpg  81619137940fa509af010da14d9775a4   \n",
       "37  60d3f0f2d2af6b6feaafcfb706u.png  90f7d3ed956e5da88d49e3349302ee7c   \n",
       "38  64743a0fe549742f9ba314b405u.png  ea024ead13082e1ebe9353fa8b0496c7   \n",
       "39  5f8f9866dfb13a7dee342f1808u.jpg  a16b12ed23e0d953c0d85e41c4b7b278   \n",
       "\n",
       "   comp.compStage                    comp.compLogo comp.compId comp.compScale  \\\n",
       "0              A轮  62f463c85732892a6615046607u.png  13146105.0         50-99人   \n",
       "1             NaN  5bfe9ab674719df0ac99e25b03a.png   7286569.0     2000-5000人   \n",
       "2             天使轮  636488445fd2f55ef5a3291b07u.png  12787419.0       100-499人   \n",
       "3            港股上市  5bfe915474719d35745acf0903a.png   1072424.0       10000人以上   \n",
       "4             NaN  5bfe9bce74719df0ac9a179703a.png   8523053.0       500-999人   \n",
       "..            ...                              ...         ...            ...   \n",
       "35             A轮  60e2fe0bf3df194a3c48adb502u.png         NaN       100-499人   \n",
       "36            NaN  5bfe980c74719df0ac995aad03a.png   8353564.0     2000-5000人   \n",
       "37            已上市  60e2fe0bf3df194a3c48adb502u.png         NaN       10000人以上   \n",
       "38          融资未公开  60e2fe0bf3df194a3c48adb502u.png         NaN       10000人以上   \n",
       "39          创业板上市  5c88aad9509919c4f51cbe9308u.jpg    855567.0    5000-10000人   \n",
       "\n",
       "     comp.compName comp.compIndustry                                 comp.link  \n",
       "0   联和存储科技(江苏)有限公司       电子/半导体/集成电路  https://www.liepin.com/company/13146105/  \n",
       "1             卡士酸奶          食品/饮料/酒水   https://www.liepin.com/company/7286569/  \n",
       "2   倍生生物科技(深圳)有限公司              生物技术  https://www.liepin.com/company/12787419/  \n",
       "3           阿里巴巴集团               互联网   https://www.liepin.com/company/1072424/  \n",
       "4              中电港       电子/半导体/集成电路   https://www.liepin.com/company/8523053/  \n",
       "..             ...               ...                                       ...  \n",
       "35       某北京电子商务公司              电子商务                                            \n",
       "36            顺丰科技               互联网   https://www.liepin.com/company/8353564/  \n",
       "37         某知名上市公司               互联网                                            \n",
       "38           某知名公司              电子商务                                            \n",
       "39    深圳市新国都股份有限公司              智能硬件    https://www.liepin.com/company/855567/  \n",
       "\n",
       "[400 rows x 31 columns]"
      ]
     },
     "execution_count": 1,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "import crawl_liepin\n",
    "城市 = input(\"请输入城市：\")\n",
    "关键词 = input(\"请输入职位：\")\n",
    "af= crawl_liepin.crawl(城市=城市,关键词=关键词)\n",
    "af"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 2,
   "id": "f03f2340",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "Index(['dataInfo', 'dataParams', 'job.labels', 'job.requireWorkYears',\n",
       "       'job.requireEduLevel', 'job.dq', 'job.jobKind', 'job.jobId',\n",
       "       'job.refreshTime', 'job.title', 'job.topJob', 'job.dataPromId',\n",
       "       'job.advViewFlag', 'job.link', 'job.salary', 'job.pcOuterLink',\n",
       "       'job.h5OuterLink', 'recruiter.recruiterName',\n",
       "       'recruiter.recruiterTitle', 'recruiter.chatted', 'recruiter.imId',\n",
       "       'recruiter.imUserType', 'recruiter.recruiterPhoto',\n",
       "       'recruiter.recruiterId', 'comp.compStage', 'comp.compLogo',\n",
       "       'comp.compId', 'comp.compScale', 'comp.compName', 'comp.compIndustry',\n",
       "       'comp.link'],\n",
       "      dtype='object')"
      ]
     },
     "execution_count": 2,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "af.columns"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "de174a13",
   "metadata": {},
   "source": [
    "# 可分析的列\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 3,
   "id": "2f20cb90",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>job.labels</th>\n",
       "      <th>job.refreshTime</th>\n",
       "      <th>job.title</th>\n",
       "      <th>job.salary</th>\n",
       "      <th>job.dq</th>\n",
       "      <th>job.topJob</th>\n",
       "      <th>job.requireWorkYears</th>\n",
       "      <th>job.requireEduLevel</th>\n",
       "      <th>comp.compStage</th>\n",
       "      <th>comp.compName</th>\n",
       "      <th>comp.compIndustry</th>\n",
       "      <th>comp.compScale</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>[电子/半导体/集成电路, 产品规划, 需求分析, 用户研究, 市场调研, 数据分析, 存储...</td>\n",
       "      <td>20231103092225</td>\n",
       "      <td>PM产品经理</td>\n",
       "      <td>20-40k·14薪</td>\n",
       "      <td>深圳-南山区</td>\n",
       "      <td>True</td>\n",
       "      <td>5-10年</td>\n",
       "      <td>统招本科</td>\n",
       "      <td>A轮</td>\n",
       "      <td>联和存储科技(江苏)有限公司</td>\n",
       "      <td>电子/半导体/集成电路</td>\n",
       "      <td>50-99人</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>[产品经理, 食品产品经理]</td>\n",
       "      <td>20231102203839</td>\n",
       "      <td>产品经理 (MJ000380)</td>\n",
       "      <td>薪资面议</td>\n",
       "      <td>深圳-宝安区</td>\n",
       "      <td>True</td>\n",
       "      <td>5-10年</td>\n",
       "      <td>本科</td>\n",
       "      <td>NaN</td>\n",
       "      <td>卡士酸奶</td>\n",
       "      <td>食品/饮料/酒水</td>\n",
       "      <td>2000-5000人</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>[平台产品, 竞品分析, 策略产品, 商业产品, 产品优化, 产品规划, 产品设计, 医疗行...</td>\n",
       "      <td>20231023105635</td>\n",
       "      <td>高级产品经理</td>\n",
       "      <td>20-30k</td>\n",
       "      <td>深圳-光明区</td>\n",
       "      <td>True</td>\n",
       "      <td>5-10年</td>\n",
       "      <td>本科</td>\n",
       "      <td>天使轮</td>\n",
       "      <td>倍生生物科技(深圳)有限公司</td>\n",
       "      <td>生物技术</td>\n",
       "      <td>100-499人</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>[数据产品, 数据挖掘, 数据分析, 产品规划]</td>\n",
       "      <td>20210311143004</td>\n",
       "      <td>Lazada-Data Product Manager-流量数据产品经理（深圳）</td>\n",
       "      <td>薪资面议</td>\n",
       "      <td>深圳</td>\n",
       "      <td>False</td>\n",
       "      <td>5-10年</td>\n",
       "      <td>学历不限</td>\n",
       "      <td>港股上市</td>\n",
       "      <td>阿里巴巴集团</td>\n",
       "      <td>互联网</td>\n",
       "      <td>10000人以上</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>[数据产品, 供应链产品, 电子/半导体/集成电路, 电子商务, 产品规划, 用户研究, 市...</td>\n",
       "      <td>20231016141158</td>\n",
       "      <td>产品经理 PM（NXP）</td>\n",
       "      <td>15-20k·15薪</td>\n",
       "      <td>深圳</td>\n",
       "      <td>False</td>\n",
       "      <td>5-10年</td>\n",
       "      <td>本科</td>\n",
       "      <td>NaN</td>\n",
       "      <td>中电港</td>\n",
       "      <td>电子/半导体/集成电路</td>\n",
       "      <td>500-999人</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>...</th>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>35</th>\n",
       "      <td>[硬件产品, 定价/竞价产品, 策略产品, 硬件系统集成, 智能硬件, 产品规划, 产品设计...</td>\n",
       "      <td>20230823112426</td>\n",
       "      <td>空气净化器产品经理</td>\n",
       "      <td>12-18k</td>\n",
       "      <td>深圳-宝安区</td>\n",
       "      <td>False</td>\n",
       "      <td>1-3年</td>\n",
       "      <td>本科</td>\n",
       "      <td>A轮</td>\n",
       "      <td>某北京电子商务公司</td>\n",
       "      <td>电子商务</td>\n",
       "      <td>100-499人</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>36</th>\n",
       "      <td>[供应链产品, 数据产品, AI人工智能产品, 物流快递, 交通/运输/物流, IT互联网,...</td>\n",
       "      <td>20230427154916</td>\n",
       "      <td>JTY-产品经理</td>\n",
       "      <td>15-25k·14薪</td>\n",
       "      <td>深圳-南山区</td>\n",
       "      <td>False</td>\n",
       "      <td>3-5年</td>\n",
       "      <td>本科</td>\n",
       "      <td>NaN</td>\n",
       "      <td>顺丰科技</td>\n",
       "      <td>互联网</td>\n",
       "      <td>2000-5000人</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>37</th>\n",
       "      <td>[OA产品, OA系统, 批发/零售, 交互设计, 产品设计, 产品规划, Axure, M...</td>\n",
       "      <td>20230914154230</td>\n",
       "      <td>高级产品经理</td>\n",
       "      <td>30-40k·14薪</td>\n",
       "      <td>深圳-福田区</td>\n",
       "      <td>False</td>\n",
       "      <td>经验不限</td>\n",
       "      <td>本科</td>\n",
       "      <td>已上市</td>\n",
       "      <td>某知名上市公司</td>\n",
       "      <td>互联网</td>\n",
       "      <td>10000人以上</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>38</th>\n",
       "      <td>[交易产品, IT互联网, 证券业, 团队管理]</td>\n",
       "      <td>20230815140559</td>\n",
       "      <td>交易系统服务产品经理</td>\n",
       "      <td>50-65k·18薪</td>\n",
       "      <td>深圳-福田区</td>\n",
       "      <td>False</td>\n",
       "      <td>5-10年</td>\n",
       "      <td>本科</td>\n",
       "      <td>融资未公开</td>\n",
       "      <td>某知名公司</td>\n",
       "      <td>电子商务</td>\n",
       "      <td>10000人以上</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>39</th>\n",
       "      <td>[支付产品, to G, to B, 消费券, 政府权益, 银行合作]</td>\n",
       "      <td>20230823102535</td>\n",
       "      <td>权益产品经理 (MJ001105)</td>\n",
       "      <td>14-23k</td>\n",
       "      <td>深圳</td>\n",
       "      <td>False</td>\n",
       "      <td>1-3年</td>\n",
       "      <td>本科</td>\n",
       "      <td>创业板上市</td>\n",
       "      <td>深圳市新国都股份有限公司</td>\n",
       "      <td>智能硬件</td>\n",
       "      <td>5000-10000人</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "<p>400 rows × 12 columns</p>\n",
       "</div>"
      ],
      "text/plain": [
       "                                           job.labels job.refreshTime  \\\n",
       "0   [电子/半导体/集成电路, 产品规划, 需求分析, 用户研究, 市场调研, 数据分析, 存储...  20231103092225   \n",
       "1                                      [产品经理, 食品产品经理]  20231102203839   \n",
       "2   [平台产品, 竞品分析, 策略产品, 商业产品, 产品优化, 产品规划, 产品设计, 医疗行...  20231023105635   \n",
       "3                            [数据产品, 数据挖掘, 数据分析, 产品规划]  20210311143004   \n",
       "4   [数据产品, 供应链产品, 电子/半导体/集成电路, 电子商务, 产品规划, 用户研究, 市...  20231016141158   \n",
       "..                                                ...             ...   \n",
       "35  [硬件产品, 定价/竞价产品, 策略产品, 硬件系统集成, 智能硬件, 产品规划, 产品设计...  20230823112426   \n",
       "36  [供应链产品, 数据产品, AI人工智能产品, 物流快递, 交通/运输/物流, IT互联网,...  20230427154916   \n",
       "37  [OA产品, OA系统, 批发/零售, 交互设计, 产品设计, 产品规划, Axure, M...  20230914154230   \n",
       "38                           [交易产品, IT互联网, 证券业, 团队管理]  20230815140559   \n",
       "39                [支付产品, to G, to B, 消费券, 政府权益, 银行合作]  20230823102535   \n",
       "\n",
       "                                   job.title  job.salary  job.dq  job.topJob  \\\n",
       "0                                     PM产品经理  20-40k·14薪  深圳-南山区        True   \n",
       "1                            产品经理 (MJ000380)        薪资面议  深圳-宝安区        True   \n",
       "2                                     高级产品经理      20-30k  深圳-光明区        True   \n",
       "3   Lazada-Data Product Manager-流量数据产品经理（深圳）        薪资面议      深圳       False   \n",
       "4                               产品经理 PM（NXP）  15-20k·15薪      深圳       False   \n",
       "..                                       ...         ...     ...         ...   \n",
       "35                                 空气净化器产品经理      12-18k  深圳-宝安区       False   \n",
       "36                                  JTY-产品经理  15-25k·14薪  深圳-南山区       False   \n",
       "37                                    高级产品经理  30-40k·14薪  深圳-福田区       False   \n",
       "38                                交易系统服务产品经理  50-65k·18薪  深圳-福田区       False   \n",
       "39                         权益产品经理 (MJ001105)      14-23k      深圳       False   \n",
       "\n",
       "   job.requireWorkYears job.requireEduLevel comp.compStage   comp.compName  \\\n",
       "0                 5-10年                统招本科             A轮  联和存储科技(江苏)有限公司   \n",
       "1                 5-10年                  本科            NaN            卡士酸奶   \n",
       "2                 5-10年                  本科            天使轮  倍生生物科技(深圳)有限公司   \n",
       "3                 5-10年                学历不限           港股上市          阿里巴巴集团   \n",
       "4                 5-10年                  本科            NaN             中电港   \n",
       "..                  ...                 ...            ...             ...   \n",
       "35                 1-3年                  本科             A轮       某北京电子商务公司   \n",
       "36                 3-5年                  本科            NaN            顺丰科技   \n",
       "37                 经验不限                  本科            已上市         某知名上市公司   \n",
       "38                5-10年                  本科          融资未公开           某知名公司   \n",
       "39                 1-3年                  本科          创业板上市    深圳市新国都股份有限公司   \n",
       "\n",
       "   comp.compIndustry comp.compScale  \n",
       "0        电子/半导体/集成电路         50-99人  \n",
       "1           食品/饮料/酒水     2000-5000人  \n",
       "2               生物技术       100-499人  \n",
       "3                互联网       10000人以上  \n",
       "4        电子/半导体/集成电路       500-999人  \n",
       "..               ...            ...  \n",
       "35              电子商务       100-499人  \n",
       "36               互联网     2000-5000人  \n",
       "37               互联网       10000人以上  \n",
       "38              电子商务       10000人以上  \n",
       "39              智能硬件    5000-10000人  \n",
       "\n",
       "[400 rows x 12 columns]"
      ]
     },
     "execution_count": 3,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "df_PM_gz =  af[['job.labels','job.refreshTime','job.title','job.salary','job.dq','job.topJob','job.requireWorkYears','job.requireEduLevel','comp.compStage','comp.compName','comp.compIndustry','comp.compScale']]\n",
    "df_PM_gz"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 4,
   "id": "1fe57c80",
   "metadata": {},
   "outputs": [],
   "source": [
    "from pyecharts import options as opts\n",
    "from pyecharts.charts import Bar, Grid, Line,Map,Page,WordCloud\n",
    "from pyecharts.faker import Faker"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "b3c67b16",
   "metadata": {},
   "source": [
    "# 地图"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 5,
   "id": "28918200",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "深圳-南山区    119\n",
       "深圳        119\n",
       "深圳-福田区     57\n",
       "深圳-宝安区     35\n",
       "深圳-龙华区     33\n",
       "深圳-龙岗区     21\n",
       "深圳-光明区      5\n",
       "深圳-罗湖区      4\n",
       "深圳-盐田区      4\n",
       "深圳-坪山区      3\n",
       "Name: job.dq, dtype: int64"
      ]
     },
     "execution_count": 5,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "df_PM_gz['job.dq'].value_counts()"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 6,
   "id": "10c0b3e9",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "['深圳-南山区',\n",
       " '深圳-福田区',\n",
       " '深圳-宝安区',\n",
       " '深圳-龙华区',\n",
       " '深圳-龙岗区',\n",
       " '深圳-光明区',\n",
       " '深圳-罗湖区',\n",
       " '深圳-盐田区',\n",
       " '深圳-坪山区']"
      ]
     },
     "execution_count": 6,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "[i for i in df_PM_gz['job.dq'].value_counts().index.tolist() if '-' in i]"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 7,
   "id": "deb28e4b",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "['南山区', '福田区', '宝安区', '龙华区', '龙岗区', '光明区', '罗湖区', '盐田区', '坪山区']"
      ]
     },
     "execution_count": 7,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "地区 = [ df_PM_gz['job.dq'].value_counts().index.tolist()[i].split('-')[1]\\\n",
    "     for i, v in enumerate(df_PM_gz['job.dq'].value_counts().index.tolist()) if '-' in v]\n",
    "地区"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 8,
   "id": "e6b873f2",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "[119, 57, 35, 33, 21, 5, 4, 4, 3]"
      ]
     },
     "execution_count": 8,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "岗位个数 =[ df_PM_gz['job.dq'].value_counts().values.tolist()[i]   for i,v in enumerate(df_PM_gz['job.dq'].value_counts().index.tolist()) if '-' in v]\n",
    "岗位个数"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "f64354f9",
   "metadata": {},
   "outputs": [],
   "source": [
    "\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 9,
   "id": "cc93eaec",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>job.labels</th>\n",
       "      <th>job.refreshTime</th>\n",
       "      <th>job.title</th>\n",
       "      <th>job.salary</th>\n",
       "      <th>job.dq</th>\n",
       "      <th>job.topJob</th>\n",
       "      <th>job.requireWorkYears</th>\n",
       "      <th>job.requireEduLevel</th>\n",
       "      <th>comp.compStage</th>\n",
       "      <th>comp.compName</th>\n",
       "      <th>comp.compIndustry</th>\n",
       "      <th>comp.compScale</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>[电子/半导体/集成电路, 产品规划, 需求分析, 用户研究, 市场调研, 数据分析, 存储...</td>\n",
       "      <td>20231103092225</td>\n",
       "      <td>PM产品经理</td>\n",
       "      <td>20-40k·14薪</td>\n",
       "      <td>深圳-南山区</td>\n",
       "      <td>True</td>\n",
       "      <td>5-10年</td>\n",
       "      <td>统招本科</td>\n",
       "      <td>A轮</td>\n",
       "      <td>联和存储科技(江苏)有限公司</td>\n",
       "      <td>电子/半导体/集成电路</td>\n",
       "      <td>50-99人</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1</th>\n",
       "      <td>[产品经理, 食品产品经理]</td>\n",
       "      <td>20231102203839</td>\n",
       "      <td>产品经理 (MJ000380)</td>\n",
       "      <td>薪资面议</td>\n",
       "      <td>深圳-宝安区</td>\n",
       "      <td>True</td>\n",
       "      <td>5-10年</td>\n",
       "      <td>本科</td>\n",
       "      <td>NaN</td>\n",
       "      <td>卡士酸奶</td>\n",
       "      <td>食品/饮料/酒水</td>\n",
       "      <td>2000-5000人</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>[平台产品, 竞品分析, 策略产品, 商业产品, 产品优化, 产品规划, 产品设计, 医疗行...</td>\n",
       "      <td>20231023105635</td>\n",
       "      <td>高级产品经理</td>\n",
       "      <td>20-30k</td>\n",
       "      <td>深圳-光明区</td>\n",
       "      <td>True</td>\n",
       "      <td>5-10年</td>\n",
       "      <td>本科</td>\n",
       "      <td>天使轮</td>\n",
       "      <td>倍生生物科技(深圳)有限公司</td>\n",
       "      <td>生物技术</td>\n",
       "      <td>100-499人</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3</th>\n",
       "      <td>[数据产品, 数据挖掘, 数据分析, 产品规划]</td>\n",
       "      <td>20210311143004</td>\n",
       "      <td>Lazada-Data Product Manager-流量数据产品经理（深圳）</td>\n",
       "      <td>薪资面议</td>\n",
       "      <td>深圳</td>\n",
       "      <td>False</td>\n",
       "      <td>5-10年</td>\n",
       "      <td>学历不限</td>\n",
       "      <td>港股上市</td>\n",
       "      <td>阿里巴巴集团</td>\n",
       "      <td>互联网</td>\n",
       "      <td>10000人以上</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>[数据产品, 供应链产品, 电子/半导体/集成电路, 电子商务, 产品规划, 用户研究, 市...</td>\n",
       "      <td>20231016141158</td>\n",
       "      <td>产品经理 PM（NXP）</td>\n",
       "      <td>15-20k·15薪</td>\n",
       "      <td>深圳</td>\n",
       "      <td>False</td>\n",
       "      <td>5-10年</td>\n",
       "      <td>本科</td>\n",
       "      <td>NaN</td>\n",
       "      <td>中电港</td>\n",
       "      <td>电子/半导体/集成电路</td>\n",
       "      <td>500-999人</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>...</th>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>34</th>\n",
       "      <td>[OA产品]</td>\n",
       "      <td>20230907193745</td>\n",
       "      <td>高级产品经理（OA系统）</td>\n",
       "      <td>30-45k·15薪</td>\n",
       "      <td>深圳-福田区</td>\n",
       "      <td>False</td>\n",
       "      <td>10年以上</td>\n",
       "      <td>学历不限</td>\n",
       "      <td>不需要融资</td>\n",
       "      <td>某深圳大型批发/零售公司</td>\n",
       "      <td>批发/零售</td>\n",
       "      <td>10000人以上</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>35</th>\n",
       "      <td>[硬件产品, 定价/竞价产品, 策略产品, 硬件系统集成, 智能硬件, 产品规划, 产品设计...</td>\n",
       "      <td>20230823112426</td>\n",
       "      <td>空气净化器产品经理</td>\n",
       "      <td>12-18k</td>\n",
       "      <td>深圳-宝安区</td>\n",
       "      <td>False</td>\n",
       "      <td>1-3年</td>\n",
       "      <td>本科</td>\n",
       "      <td>A轮</td>\n",
       "      <td>某北京电子商务公司</td>\n",
       "      <td>电子商务</td>\n",
       "      <td>100-499人</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>36</th>\n",
       "      <td>[供应链产品, 数据产品, AI人工智能产品, 物流快递, 交通/运输/物流, IT互联网,...</td>\n",
       "      <td>20230427154916</td>\n",
       "      <td>JTY-产品经理</td>\n",
       "      <td>15-25k·14薪</td>\n",
       "      <td>深圳-南山区</td>\n",
       "      <td>False</td>\n",
       "      <td>3-5年</td>\n",
       "      <td>本科</td>\n",
       "      <td>NaN</td>\n",
       "      <td>顺丰科技</td>\n",
       "      <td>互联网</td>\n",
       "      <td>2000-5000人</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>38</th>\n",
       "      <td>[交易产品, IT互联网, 证券业, 团队管理]</td>\n",
       "      <td>20230815140559</td>\n",
       "      <td>交易系统服务产品经理</td>\n",
       "      <td>50-65k·18薪</td>\n",
       "      <td>深圳-福田区</td>\n",
       "      <td>False</td>\n",
       "      <td>5-10年</td>\n",
       "      <td>本科</td>\n",
       "      <td>融资未公开</td>\n",
       "      <td>某知名公司</td>\n",
       "      <td>电子商务</td>\n",
       "      <td>10000人以上</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>39</th>\n",
       "      <td>[支付产品, to G, to B, 消费券, 政府权益, 银行合作]</td>\n",
       "      <td>20230823102535</td>\n",
       "      <td>权益产品经理 (MJ001105)</td>\n",
       "      <td>14-23k</td>\n",
       "      <td>深圳</td>\n",
       "      <td>False</td>\n",
       "      <td>1-3年</td>\n",
       "      <td>本科</td>\n",
       "      <td>创业板上市</td>\n",
       "      <td>深圳市新国都股份有限公司</td>\n",
       "      <td>智能硬件</td>\n",
       "      <td>5000-10000人</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "<p>376 rows × 12 columns</p>\n",
       "</div>"
      ],
      "text/plain": [
       "                                           job.labels job.refreshTime  \\\n",
       "0   [电子/半导体/集成电路, 产品规划, 需求分析, 用户研究, 市场调研, 数据分析, 存储...  20231103092225   \n",
       "1                                      [产品经理, 食品产品经理]  20231102203839   \n",
       "2   [平台产品, 竞品分析, 策略产品, 商业产品, 产品优化, 产品规划, 产品设计, 医疗行...  20231023105635   \n",
       "3                            [数据产品, 数据挖掘, 数据分析, 产品规划]  20210311143004   \n",
       "4   [数据产品, 供应链产品, 电子/半导体/集成电路, 电子商务, 产品规划, 用户研究, 市...  20231016141158   \n",
       "..                                                ...             ...   \n",
       "34                                             [OA产品]  20230907193745   \n",
       "35  [硬件产品, 定价/竞价产品, 策略产品, 硬件系统集成, 智能硬件, 产品规划, 产品设计...  20230823112426   \n",
       "36  [供应链产品, 数据产品, AI人工智能产品, 物流快递, 交通/运输/物流, IT互联网,...  20230427154916   \n",
       "38                           [交易产品, IT互联网, 证券业, 团队管理]  20230815140559   \n",
       "39                [支付产品, to G, to B, 消费券, 政府权益, 银行合作]  20230823102535   \n",
       "\n",
       "                                   job.title  job.salary  job.dq  job.topJob  \\\n",
       "0                                     PM产品经理  20-40k·14薪  深圳-南山区        True   \n",
       "1                            产品经理 (MJ000380)        薪资面议  深圳-宝安区        True   \n",
       "2                                     高级产品经理      20-30k  深圳-光明区        True   \n",
       "3   Lazada-Data Product Manager-流量数据产品经理（深圳）        薪资面议      深圳       False   \n",
       "4                               产品经理 PM（NXP）  15-20k·15薪      深圳       False   \n",
       "..                                       ...         ...     ...         ...   \n",
       "34                              高级产品经理（OA系统）  30-45k·15薪  深圳-福田区       False   \n",
       "35                                 空气净化器产品经理      12-18k  深圳-宝安区       False   \n",
       "36                                  JTY-产品经理  15-25k·14薪  深圳-南山区       False   \n",
       "38                                交易系统服务产品经理  50-65k·18薪  深圳-福田区       False   \n",
       "39                         权益产品经理 (MJ001105)      14-23k      深圳       False   \n",
       "\n",
       "   job.requireWorkYears job.requireEduLevel comp.compStage   comp.compName  \\\n",
       "0                 5-10年                统招本科             A轮  联和存储科技(江苏)有限公司   \n",
       "1                 5-10年                  本科            NaN            卡士酸奶   \n",
       "2                 5-10年                  本科            天使轮  倍生生物科技(深圳)有限公司   \n",
       "3                 5-10年                学历不限           港股上市          阿里巴巴集团   \n",
       "4                 5-10年                  本科            NaN             中电港   \n",
       "..                  ...                 ...            ...             ...   \n",
       "34                10年以上                学历不限          不需要融资    某深圳大型批发/零售公司   \n",
       "35                 1-3年                  本科             A轮       某北京电子商务公司   \n",
       "36                 3-5年                  本科            NaN            顺丰科技   \n",
       "38                5-10年                  本科          融资未公开           某知名公司   \n",
       "39                 1-3年                  本科          创业板上市    深圳市新国都股份有限公司   \n",
       "\n",
       "   comp.compIndustry comp.compScale  \n",
       "0        电子/半导体/集成电路         50-99人  \n",
       "1           食品/饮料/酒水     2000-5000人  \n",
       "2               生物技术       100-499人  \n",
       "3                互联网       10000人以上  \n",
       "4        电子/半导体/集成电路       500-999人  \n",
       "..               ...            ...  \n",
       "34             批发/零售       10000人以上  \n",
       "35              电子商务       100-499人  \n",
       "36               互联网     2000-5000人  \n",
       "38              电子商务       10000人以上  \n",
       "39              智能硬件    5000-10000人  \n",
       "\n",
       "[376 rows x 12 columns]"
      ]
     },
     "execution_count": 9,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "we = df_PM_gz[df_PM_gz['job.requireWorkYears'].fillna('').str.strip() != '经验不限']\n",
    "we"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "5622bef3",
   "metadata": {},
   "source": [
    "# 平均薪资"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 10,
   "id": "53449ead",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>job.labels</th>\n",
       "      <th>job.refreshTime</th>\n",
       "      <th>job.title</th>\n",
       "      <th>job.salary</th>\n",
       "      <th>job.dq</th>\n",
       "      <th>job.topJob</th>\n",
       "      <th>job.requireWorkYears</th>\n",
       "      <th>job.requireEduLevel</th>\n",
       "      <th>comp.compStage</th>\n",
       "      <th>comp.compName</th>\n",
       "      <th>comp.compIndustry</th>\n",
       "      <th>comp.compScale</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>[电子/半导体/集成电路, 产品规划, 需求分析, 用户研究, 市场调研, 数据分析, 存储...</td>\n",
       "      <td>20231103092225</td>\n",
       "      <td>PM产品经理</td>\n",
       "      <td>20-40k·14薪</td>\n",
       "      <td>深圳-南山区</td>\n",
       "      <td>True</td>\n",
       "      <td>5-10年</td>\n",
       "      <td>统招本科</td>\n",
       "      <td>A轮</td>\n",
       "      <td>联和存储科技(江苏)有限公司</td>\n",
       "      <td>电子/半导体/集成电路</td>\n",
       "      <td>50-99人</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>[平台产品, 竞品分析, 策略产品, 商业产品, 产品优化, 产品规划, 产品设计, 医疗行...</td>\n",
       "      <td>20231023105635</td>\n",
       "      <td>高级产品经理</td>\n",
       "      <td>20-30k</td>\n",
       "      <td>深圳-光明区</td>\n",
       "      <td>True</td>\n",
       "      <td>5-10年</td>\n",
       "      <td>本科</td>\n",
       "      <td>天使轮</td>\n",
       "      <td>倍生生物科技(深圳)有限公司</td>\n",
       "      <td>生物技术</td>\n",
       "      <td>100-499人</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>[数据产品, 供应链产品, 电子/半导体/集成电路, 电子商务, 产品规划, 用户研究, 市...</td>\n",
       "      <td>20231016141158</td>\n",
       "      <td>产品经理 PM（NXP）</td>\n",
       "      <td>15-20k·15薪</td>\n",
       "      <td>深圳</td>\n",
       "      <td>False</td>\n",
       "      <td>5-10年</td>\n",
       "      <td>本科</td>\n",
       "      <td>NaN</td>\n",
       "      <td>中电港</td>\n",
       "      <td>电子/半导体/集成电路</td>\n",
       "      <td>500-999人</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>6</th>\n",
       "      <td>[功率器件]</td>\n",
       "      <td>20231104101628</td>\n",
       "      <td>产品经理PM</td>\n",
       "      <td>20-40k·13薪</td>\n",
       "      <td>深圳-宝安区</td>\n",
       "      <td>False</td>\n",
       "      <td>5-10年</td>\n",
       "      <td>本科</td>\n",
       "      <td>A轮</td>\n",
       "      <td>某深圳大型电子/半导体/集成电路公司</td>\n",
       "      <td>电子/半导体/集成电路</td>\n",
       "      <td>500-999人</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>7</th>\n",
       "      <td>[产品规划, 市场调研, 用户研究, 电子商务]</td>\n",
       "      <td>20230928114733</td>\n",
       "      <td>PM产品经理</td>\n",
       "      <td>13-16k</td>\n",
       "      <td>深圳-南山区</td>\n",
       "      <td>False</td>\n",
       "      <td>5-10年</td>\n",
       "      <td>大专</td>\n",
       "      <td>NaN</td>\n",
       "      <td>深圳中积芯科技有限公司</td>\n",
       "      <td>电子/半导体/集成电路</td>\n",
       "      <td>1-49人</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>...</th>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>34</th>\n",
       "      <td>[OA产品]</td>\n",
       "      <td>20230907193745</td>\n",
       "      <td>高级产品经理（OA系统）</td>\n",
       "      <td>30-45k·15薪</td>\n",
       "      <td>深圳-福田区</td>\n",
       "      <td>False</td>\n",
       "      <td>10年以上</td>\n",
       "      <td>学历不限</td>\n",
       "      <td>不需要融资</td>\n",
       "      <td>某深圳大型批发/零售公司</td>\n",
       "      <td>批发/零售</td>\n",
       "      <td>10000人以上</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>35</th>\n",
       "      <td>[硬件产品, 定价/竞价产品, 策略产品, 硬件系统集成, 智能硬件, 产品规划, 产品设计...</td>\n",
       "      <td>20230823112426</td>\n",
       "      <td>空气净化器产品经理</td>\n",
       "      <td>12-18k</td>\n",
       "      <td>深圳-宝安区</td>\n",
       "      <td>False</td>\n",
       "      <td>1-3年</td>\n",
       "      <td>本科</td>\n",
       "      <td>A轮</td>\n",
       "      <td>某北京电子商务公司</td>\n",
       "      <td>电子商务</td>\n",
       "      <td>100-499人</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>36</th>\n",
       "      <td>[供应链产品, 数据产品, AI人工智能产品, 物流快递, 交通/运输/物流, IT互联网,...</td>\n",
       "      <td>20230427154916</td>\n",
       "      <td>JTY-产品经理</td>\n",
       "      <td>15-25k·14薪</td>\n",
       "      <td>深圳-南山区</td>\n",
       "      <td>False</td>\n",
       "      <td>3-5年</td>\n",
       "      <td>本科</td>\n",
       "      <td>NaN</td>\n",
       "      <td>顺丰科技</td>\n",
       "      <td>互联网</td>\n",
       "      <td>2000-5000人</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>38</th>\n",
       "      <td>[交易产品, IT互联网, 证券业, 团队管理]</td>\n",
       "      <td>20230815140559</td>\n",
       "      <td>交易系统服务产品经理</td>\n",
       "      <td>50-65k·18薪</td>\n",
       "      <td>深圳-福田区</td>\n",
       "      <td>False</td>\n",
       "      <td>5-10年</td>\n",
       "      <td>本科</td>\n",
       "      <td>融资未公开</td>\n",
       "      <td>某知名公司</td>\n",
       "      <td>电子商务</td>\n",
       "      <td>10000人以上</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>39</th>\n",
       "      <td>[支付产品, to G, to B, 消费券, 政府权益, 银行合作]</td>\n",
       "      <td>20230823102535</td>\n",
       "      <td>权益产品经理 (MJ001105)</td>\n",
       "      <td>14-23k</td>\n",
       "      <td>深圳</td>\n",
       "      <td>False</td>\n",
       "      <td>1-3年</td>\n",
       "      <td>本科</td>\n",
       "      <td>创业板上市</td>\n",
       "      <td>深圳市新国都股份有限公司</td>\n",
       "      <td>智能硬件</td>\n",
       "      <td>5000-10000人</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "<p>348 rows × 12 columns</p>\n",
       "</div>"
      ],
      "text/plain": [
       "                                           job.labels job.refreshTime  \\\n",
       "0   [电子/半导体/集成电路, 产品规划, 需求分析, 用户研究, 市场调研, 数据分析, 存储...  20231103092225   \n",
       "2   [平台产品, 竞品分析, 策略产品, 商业产品, 产品优化, 产品规划, 产品设计, 医疗行...  20231023105635   \n",
       "4   [数据产品, 供应链产品, 电子/半导体/集成电路, 电子商务, 产品规划, 用户研究, 市...  20231016141158   \n",
       "6                                              [功率器件]  20231104101628   \n",
       "7                            [产品规划, 市场调研, 用户研究, 电子商务]  20230928114733   \n",
       "..                                                ...             ...   \n",
       "34                                             [OA产品]  20230907193745   \n",
       "35  [硬件产品, 定价/竞价产品, 策略产品, 硬件系统集成, 智能硬件, 产品规划, 产品设计...  20230823112426   \n",
       "36  [供应链产品, 数据产品, AI人工智能产品, 物流快递, 交通/运输/物流, IT互联网,...  20230427154916   \n",
       "38                           [交易产品, IT互联网, 证券业, 团队管理]  20230815140559   \n",
       "39                [支付产品, to G, to B, 消费券, 政府权益, 银行合作]  20230823102535   \n",
       "\n",
       "            job.title  job.salary  job.dq  job.topJob job.requireWorkYears  \\\n",
       "0              PM产品经理  20-40k·14薪  深圳-南山区        True                5-10年   \n",
       "2              高级产品经理      20-30k  深圳-光明区        True                5-10年   \n",
       "4        产品经理 PM（NXP）  15-20k·15薪      深圳       False                5-10年   \n",
       "6              产品经理PM  20-40k·13薪  深圳-宝安区       False                5-10年   \n",
       "7              PM产品经理      13-16k  深圳-南山区       False                5-10年   \n",
       "..                ...         ...     ...         ...                  ...   \n",
       "34       高级产品经理（OA系统）  30-45k·15薪  深圳-福田区       False                10年以上   \n",
       "35          空气净化器产品经理      12-18k  深圳-宝安区       False                 1-3年   \n",
       "36           JTY-产品经理  15-25k·14薪  深圳-南山区       False                 3-5年   \n",
       "38         交易系统服务产品经理  50-65k·18薪  深圳-福田区       False                5-10年   \n",
       "39  权益产品经理 (MJ001105)      14-23k      深圳       False                 1-3年   \n",
       "\n",
       "   job.requireEduLevel comp.compStage       comp.compName comp.compIndustry  \\\n",
       "0                 统招本科             A轮      联和存储科技(江苏)有限公司       电子/半导体/集成电路   \n",
       "2                   本科            天使轮      倍生生物科技(深圳)有限公司              生物技术   \n",
       "4                   本科            NaN                 中电港       电子/半导体/集成电路   \n",
       "6                   本科             A轮  某深圳大型电子/半导体/集成电路公司       电子/半导体/集成电路   \n",
       "7                   大专            NaN         深圳中积芯科技有限公司       电子/半导体/集成电路   \n",
       "..                 ...            ...                 ...               ...   \n",
       "34                学历不限          不需要融资        某深圳大型批发/零售公司             批发/零售   \n",
       "35                  本科             A轮           某北京电子商务公司              电子商务   \n",
       "36                  本科            NaN                顺丰科技               互联网   \n",
       "38                  本科          融资未公开               某知名公司              电子商务   \n",
       "39                  本科          创业板上市        深圳市新国都股份有限公司              智能硬件   \n",
       "\n",
       "   comp.compScale  \n",
       "0          50-99人  \n",
       "2        100-499人  \n",
       "4        500-999人  \n",
       "6        500-999人  \n",
       "7           1-49人  \n",
       "..            ...  \n",
       "34       10000人以上  \n",
       "35       100-499人  \n",
       "36     2000-5000人  \n",
       "38       10000人以上  \n",
       "39    5000-10000人  \n",
       "\n",
       "[348 rows x 12 columns]"
      ]
     },
     "execution_count": 10,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "非薪资面议=we[ ~we['job.salary'].str.contains(\"薪资面议\"and \"面议|元/天\") ]\n",
    "\n",
    "非薪资面议"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 11,
   "id": "ca0f2d89",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "[['20-40k', '14'],\n",
       " ['20-30k'],\n",
       " ['15-20k', '15'],\n",
       " ['20-40k', '13'],\n",
       " ['13-16k'],\n",
       " ['20-50k'],\n",
       " ['12-20k', '13'],\n",
       " ['10-20k', '13'],\n",
       " ['15-20k'],\n",
       " ['15-30k'],\n",
       " ['14-20k'],\n",
       " ['20-40k'],\n",
       " ['10-15k'],\n",
       " ['23-35k', '15'],\n",
       " ['15-25k', '13'],\n",
       " ['10-20k'],\n",
       " ['15-25k'],\n",
       " ['15-30k', '16'],\n",
       " ['15-25k'],\n",
       " ['10-12k'],\n",
       " ['8-15k'],\n",
       " ['15-25k', '13'],\n",
       " ['10-30k'],\n",
       " ['20-50k'],\n",
       " ['8-15k'],\n",
       " ['15-25k'],\n",
       " ['25-30k', '16'],\n",
       " ['20-30k', '13'],\n",
       " ['10-25k'],\n",
       " ['10-20k'],\n",
       " ['20-40k', '13'],\n",
       " ['20-40k'],\n",
       " ['25-40k', '14'],\n",
       " ['20-40k', '13'],\n",
       " ['15-25k', '18'],\n",
       " ['13-18k'],\n",
       " ['40-60k'],\n",
       " ['15-25k'],\n",
       " ['10-15k'],\n",
       " ['18-25k', '13'],\n",
       " ['20-30k'],\n",
       " ['15-30k', '13'],\n",
       " ['15-30k', '16'],\n",
       " ['10-20k', '13'],\n",
       " ['20-35k', '13'],\n",
       " ['10-18k'],\n",
       " ['10-15k'],\n",
       " ['25-45k', '14'],\n",
       " ['10-20k', '13'],\n",
       " ['15-30k', '16'],\n",
       " ['12-20k'],\n",
       " ['25-50k', '13'],\n",
       " ['20-40k', '14'],\n",
       " ['10-30k', '13'],\n",
       " ['15-18k'],\n",
       " ['10-15k'],\n",
       " ['10-15k'],\n",
       " ['15-20k'],\n",
       " ['12-24k'],\n",
       " ['12-15k'],\n",
       " ['8-15k'],\n",
       " ['8-13k'],\n",
       " ['12-15k'],\n",
       " ['15-25k', '16'],\n",
       " ['15-20k', '18'],\n",
       " ['15-30k'],\n",
       " ['20-35k'],\n",
       " ['15-25k'],\n",
       " ['18-28k'],\n",
       " ['12-18k'],\n",
       " ['15-20k', '17'],\n",
       " ['17-25k', '13'],\n",
       " ['15-25k'],\n",
       " ['15-25k', '14'],\n",
       " ['10-30k', '13'],\n",
       " ['25-40k'],\n",
       " ['10-15k'],\n",
       " ['20-23k'],\n",
       " ['18-25k'],\n",
       " ['15-20k', '13'],\n",
       " ['13-18k', '13'],\n",
       " ['13-20k'],\n",
       " ['10-20k', '14'],\n",
       " ['8-12k'],\n",
       " ['8-16k', '13'],\n",
       " ['20-33k'],\n",
       " ['15-25k'],\n",
       " ['110-180k', '14'],\n",
       " ['13-26k'],\n",
       " ['15-30k', '14'],\n",
       " ['20-40k'],\n",
       " ['20-40k'],\n",
       " ['15-25k'],\n",
       " ['12-24k'],\n",
       " ['9-10k', '15'],\n",
       " ['10-15k', '13'],\n",
       " ['8-10k'],\n",
       " ['20-30k'],\n",
       " ['20-40k', '13'],\n",
       " ['65-85k', '14'],\n",
       " ['30-45k', '15'],\n",
       " ['15-30k', '14'],\n",
       " ['18-22k'],\n",
       " ['15-25k'],\n",
       " ['20-30k'],\n",
       " ['18-40k', '14'],\n",
       " ['25-35k', '14'],\n",
       " ['40-70k', '18'],\n",
       " ['25-40k'],\n",
       " ['25-55k', '14'],\n",
       " ['15-30k', '13'],\n",
       " ['15-20k', '18'],\n",
       " ['15-25k', '15'],\n",
       " ['25-45k', '13'],\n",
       " ['15-25k', '15'],\n",
       " ['15-25k', '13'],\n",
       " ['12-20k', '13'],\n",
       " ['16-35k'],\n",
       " ['20-30k', '14'],\n",
       " ['18-25k', '13'],\n",
       " ['25-40k', '15'],\n",
       " ['15-25k'],\n",
       " ['18-26k'],\n",
       " ['15-25k'],\n",
       " ['12-25k', '16'],\n",
       " ['20-40k'],\n",
       " ['25-50k'],\n",
       " ['20-30k'],\n",
       " ['15-25k'],\n",
       " ['15-25k'],\n",
       " ['15-20k'],\n",
       " ['20-30k'],\n",
       " ['20-30k'],\n",
       " ['20-40k', '16'],\n",
       " ['10-20k'],\n",
       " ['15-35k', '13'],\n",
       " ['10-13k'],\n",
       " ['5-7k'],\n",
       " ['15-30k', '16'],\n",
       " ['20-40k'],\n",
       " ['15-25k', '13'],\n",
       " ['10-11k'],\n",
       " ['15-25k'],\n",
       " ['12-20k'],\n",
       " ['20-25k'],\n",
       " ['15-20k', '14'],\n",
       " ['15-25k'],\n",
       " ['10-15k'],\n",
       " ['30-50k', '15'],\n",
       " ['20-35k', '14'],\n",
       " ['20-40k'],\n",
       " ['13-20k'],\n",
       " ['40-65k', '15'],\n",
       " ['40-45k', '15'],\n",
       " ['25-55k', '16'],\n",
       " ['30-50k', '14'],\n",
       " ['15-25k', '18'],\n",
       " ['25-30k'],\n",
       " ['20-40k'],\n",
       " ['15-16k'],\n",
       " ['15-20k'],\n",
       " ['15-30k'],\n",
       " ['22-30k', '16'],\n",
       " ['12-28k'],\n",
       " ['15-25k', '13'],\n",
       " ['15-25k'],\n",
       " ['20-35k'],\n",
       " ['12-20k', '13'],\n",
       " ['20-35k'],\n",
       " ['25-35k', '13'],\n",
       " ['20-30k', '14'],\n",
       " ['15-25k'],\n",
       " ['18-25k'],\n",
       " ['15-25k', '13'],\n",
       " ['15-25k'],\n",
       " ['12-21k'],\n",
       " ['11-22k'],\n",
       " ['20-25k', '13'],\n",
       " ['8-20k'],\n",
       " ['15-20k'],\n",
       " ['15-25k'],\n",
       " ['14-25k', '15'],\n",
       " ['18-30k'],\n",
       " ['15-20k', '13'],\n",
       " ['20-35k', '13'],\n",
       " ['10-15k'],\n",
       " ['10-15k'],\n",
       " ['15-20k', '13'],\n",
       " ['10-15k'],\n",
       " ['15-20k', '13'],\n",
       " ['10-20k', '13'],\n",
       " ['20-28k', '14'],\n",
       " ['15-20k'],\n",
       " ['11-18k'],\n",
       " ['15-25k'],\n",
       " ['40-70k'],\n",
       " ['30-60k', '16'],\n",
       " ['12-20k'],\n",
       " ['20-30k', '13'],\n",
       " ['40-60k', '14'],\n",
       " ['18-30k'],\n",
       " ['20-30k', '13'],\n",
       " ['25-50k'],\n",
       " ['25-45k'],\n",
       " ['12-16k'],\n",
       " ['20-30k', '14'],\n",
       " ['25-50k', '16'],\n",
       " ['30-60k', '18'],\n",
       " ['30-60k', '15'],\n",
       " ['40-70k'],\n",
       " ['20-35k'],\n",
       " ['12-18k'],\n",
       " ['15-30k'],\n",
       " ['45-75k'],\n",
       " ['8-10k'],\n",
       " ['10-13k'],\n",
       " ['8-10k'],\n",
       " ['80-110k', '18'],\n",
       " ['40-70k', '16'],\n",
       " ['15-25k'],\n",
       " ['25-45k', '16'],\n",
       " ['30-40k', '13'],\n",
       " ['15-23k'],\n",
       " ['15-20k'],\n",
       " ['30-50k'],\n",
       " ['45-50k'],\n",
       " ['25-40k', '15'],\n",
       " ['50-70k'],\n",
       " ['20-30k', '13'],\n",
       " ['35-65k', '15'],\n",
       " ['25-50k', '16'],\n",
       " ['18-20k'],\n",
       " ['40-50k'],\n",
       " ['20-30k'],\n",
       " ['30-50k', '14'],\n",
       " ['15-25k', '14'],\n",
       " ['20-30k'],\n",
       " ['30-50k'],\n",
       " ['30-50k', '14'],\n",
       " ['15-25k'],\n",
       " ['14-18k'],\n",
       " ['80-110k', '15'],\n",
       " ['30-50k', '13'],\n",
       " ['18-30k'],\n",
       " ['12-20k'],\n",
       " ['25-35k'],\n",
       " ['25-40k'],\n",
       " ['30-60k'],\n",
       " ['15-25k'],\n",
       " ['7-18k'],\n",
       " ['15-25k'],\n",
       " ['18-25k', '13'],\n",
       " ['15-30k'],\n",
       " ['30-60k', '15'],\n",
       " ['20-25k'],\n",
       " ['15-25k'],\n",
       " ['30-45k', '15'],\n",
       " ['20-40k', '13'],\n",
       " ['25-50k'],\n",
       " ['40-70k', '16'],\n",
       " ['25-50k'],\n",
       " ['50-80k'],\n",
       " ['30-50k'],\n",
       " ['20-35k'],\n",
       " ['15-30k'],\n",
       " ['30-50k'],\n",
       " ['13-25k'],\n",
       " ['20-35k'],\n",
       " ['15-30k', '15'],\n",
       " ['10-15k', '13'],\n",
       " ['25-30k', '13'],\n",
       " ['25-55k'],\n",
       " ['20-40k', '15'],\n",
       " ['20-40k'],\n",
       " ['25-40k', '13'],\n",
       " ['35-45k', '15'],\n",
       " ['30-40k'],\n",
       " ['30-50k'],\n",
       " ['30-55k', '15'],\n",
       " ['15-30k', '13'],\n",
       " ['30-55k', '14'],\n",
       " ['15-20k'],\n",
       " ['23-40k'],\n",
       " ['30-50k', '14'],\n",
       " ['20-30k', '13'],\n",
       " ['18-25k', '18'],\n",
       " ['30-45k', '14'],\n",
       " ['30-50k'],\n",
       " ['10-20k'],\n",
       " ['35-40k', '13'],\n",
       " ['15-25k'],\n",
       " ['20-30k', '13'],\n",
       " ['65-95k'],\n",
       " ['15-25k'],\n",
       " ['75-100k'],\n",
       " ['18-23k', '13'],\n",
       " ['15-30k'],\n",
       " ['25-35k', '15'],\n",
       " ['15-20k'],\n",
       " ['30-60k', '16'],\n",
       " ['10-30k'],\n",
       " ['25-50k', '14'],\n",
       " ['60-80k'],\n",
       " ['25-40k', '14'],\n",
       " ['30-35k', '15'],\n",
       " ['25-28k'],\n",
       " ['15-30k', '14'],\n",
       " ['30-40k', '13'],\n",
       " ['18-23k', '16'],\n",
       " ['25-30k', '13'],\n",
       " ['30-35k'],\n",
       " ['10-15k', '14'],\n",
       " ['30-45k'],\n",
       " ['20-40k'],\n",
       " ['25-40k', '14'],\n",
       " ['25-50k', '13'],\n",
       " ['20-25k'],\n",
       " ['20-25k', '15'],\n",
       " ['20-35k', '13'],\n",
       " ['30-60k', '16'],\n",
       " ['15-25k'],\n",
       " ['15-40k'],\n",
       " ['10-30k', '16'],\n",
       " ['15-30k'],\n",
       " ['20-40k', '16'],\n",
       " ['21-25k'],\n",
       " ['45-60k', '20'],\n",
       " ['30-45k'],\n",
       " ['35-50k', '13'],\n",
       " ['15-30k', '14'],\n",
       " ['15-30k'],\n",
       " ['35-50k'],\n",
       " ['15-25k'],\n",
       " ['30-60k', '16'],\n",
       " ['15-25k'],\n",
       " ['40-55k', '15'],\n",
       " ['15-20k', '16'],\n",
       " ['35-45k'],\n",
       " ['40-70k', '17'],\n",
       " ['20-35k', '13'],\n",
       " ['35-55k', '14'],\n",
       " ['25-30k', '13'],\n",
       " ['28-40k', '14'],\n",
       " ['30-45k', '15'],\n",
       " ['12-18k'],\n",
       " ['15-25k', '14'],\n",
       " ['50-65k', '18'],\n",
       " ['14-23k']]"
      ]
     },
     "execution_count": 11,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "非薪资面议_detail = 非薪资面议['job.salary'].apply(lambda x:x.split('薪')[0].split('·')).tolist()\n",
    "非薪资面议_detail"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 12,
   "id": "cf481588",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "[35.0,\n",
       " 25.0,\n",
       " 21.9,\n",
       " 32.5,\n",
       " 14.5,\n",
       " 35.0,\n",
       " 17.3,\n",
       " 16.2,\n",
       " 17.5,\n",
       " 22.5,\n",
       " 17.0,\n",
       " 30.0,\n",
       " 12.5,\n",
       " 36.2,\n",
       " 21.7,\n",
       " 15.0,\n",
       " 20.0,\n",
       " 30.0,\n",
       " 20.0,\n",
       " 11.0,\n",
       " 11.5,\n",
       " 21.7,\n",
       " 20.0,\n",
       " 35.0,\n",
       " 11.5,\n",
       " 20.0,\n",
       " 36.7,\n",
       " 27.1,\n",
       " 17.5,\n",
       " 15.0,\n",
       " 32.5,\n",
       " 30.0,\n",
       " 37.9,\n",
       " 32.5,\n",
       " 30.0,\n",
       " 15.5,\n",
       " 50.0,\n",
       " 20.0,\n",
       " 12.5,\n",
       " 23.3,\n",
       " 25.0,\n",
       " 24.4,\n",
       " 30.0,\n",
       " 16.2,\n",
       " 29.8,\n",
       " 14.0,\n",
       " 12.5,\n",
       " 40.8,\n",
       " 16.2,\n",
       " 30.0,\n",
       " 16.0,\n",
       " 40.6,\n",
       " 35.0,\n",
       " 21.7,\n",
       " 16.5,\n",
       " 12.5,\n",
       " 12.5,\n",
       " 17.5,\n",
       " 18.0,\n",
       " 13.5,\n",
       " 11.5,\n",
       " 10.5,\n",
       " 13.5,\n",
       " 26.7,\n",
       " 26.2,\n",
       " 22.5,\n",
       " 27.5,\n",
       " 20.0,\n",
       " 23.0,\n",
       " 15.0,\n",
       " 24.8,\n",
       " 22.8,\n",
       " 20.0,\n",
       " 23.3,\n",
       " 21.7,\n",
       " 32.5,\n",
       " 12.5,\n",
       " 21.5,\n",
       " 21.5,\n",
       " 19.0,\n",
       " 16.8,\n",
       " 16.5,\n",
       " 17.5,\n",
       " 10.0,\n",
       " 13.0,\n",
       " 26.5,\n",
       " 20.0,\n",
       " 169.2,\n",
       " 19.5,\n",
       " 26.2,\n",
       " 30.0,\n",
       " 30.0,\n",
       " 20.0,\n",
       " 18.0,\n",
       " 11.9,\n",
       " 13.5,\n",
       " 9.0,\n",
       " 25.0,\n",
       " 32.5,\n",
       " 87.5,\n",
       " 46.9,\n",
       " 26.2,\n",
       " 20.0,\n",
       " 20.0,\n",
       " 25.0,\n",
       " 33.8,\n",
       " 35.0,\n",
       " 82.5,\n",
       " 32.5,\n",
       " 46.7,\n",
       " 24.4,\n",
       " 26.2,\n",
       " 25.0,\n",
       " 37.9,\n",
       " 25.0,\n",
       " 21.7,\n",
       " 17.3,\n",
       " 25.5,\n",
       " 29.2,\n",
       " 23.3,\n",
       " 40.6,\n",
       " 20.0,\n",
       " 22.0,\n",
       " 20.0,\n",
       " 24.7,\n",
       " 30.0,\n",
       " 37.5,\n",
       " 25.0,\n",
       " 20.0,\n",
       " 20.0,\n",
       " 17.5,\n",
       " 25.0,\n",
       " 25.0,\n",
       " 40.0,\n",
       " 15.0,\n",
       " 27.1,\n",
       " 11.5,\n",
       " 6.0,\n",
       " 30.0,\n",
       " 30.0,\n",
       " 21.7,\n",
       " 10.5,\n",
       " 20.0,\n",
       " 16.0,\n",
       " 22.5,\n",
       " 20.4,\n",
       " 20.0,\n",
       " 12.5,\n",
       " 50.0,\n",
       " 32.1,\n",
       " 30.0,\n",
       " 16.5,\n",
       " 65.6,\n",
       " 53.1,\n",
       " 53.3,\n",
       " 46.7,\n",
       " 30.0,\n",
       " 27.5,\n",
       " 30.0,\n",
       " 15.5,\n",
       " 17.5,\n",
       " 22.5,\n",
       " 34.7,\n",
       " 20.0,\n",
       " 21.7,\n",
       " 20.0,\n",
       " 27.5,\n",
       " 17.3,\n",
       " 27.5,\n",
       " 32.5,\n",
       " 29.2,\n",
       " 20.0,\n",
       " 21.5,\n",
       " 21.7,\n",
       " 20.0,\n",
       " 16.5,\n",
       " 16.5,\n",
       " 24.4,\n",
       " 14.0,\n",
       " 17.5,\n",
       " 20.0,\n",
       " 24.4,\n",
       " 24.0,\n",
       " 19.0,\n",
       " 29.8,\n",
       " 12.5,\n",
       " 12.5,\n",
       " 19.0,\n",
       " 12.5,\n",
       " 19.0,\n",
       " 16.2,\n",
       " 28.0,\n",
       " 17.5,\n",
       " 14.5,\n",
       " 20.0,\n",
       " 55.0,\n",
       " 60.0,\n",
       " 16.0,\n",
       " 27.1,\n",
       " 58.3,\n",
       " 24.0,\n",
       " 27.1,\n",
       " 37.5,\n",
       " 35.0,\n",
       " 14.0,\n",
       " 29.2,\n",
       " 50.0,\n",
       " 67.5,\n",
       " 56.2,\n",
       " 55.0,\n",
       " 27.5,\n",
       " 15.0,\n",
       " 22.5,\n",
       " 60.0,\n",
       " 9.0,\n",
       " 11.5,\n",
       " 9.0,\n",
       " 142.5,\n",
       " 73.3,\n",
       " 20.0,\n",
       " 46.7,\n",
       " 37.9,\n",
       " 19.0,\n",
       " 17.5,\n",
       " 40.0,\n",
       " 47.5,\n",
       " 40.6,\n",
       " 60.0,\n",
       " 27.1,\n",
       " 62.5,\n",
       " 50.0,\n",
       " 19.0,\n",
       " 45.0,\n",
       " 25.0,\n",
       " 46.7,\n",
       " 23.3,\n",
       " 25.0,\n",
       " 40.0,\n",
       " 46.7,\n",
       " 20.0,\n",
       " 16.0,\n",
       " 118.8,\n",
       " 43.3,\n",
       " 24.0,\n",
       " 16.0,\n",
       " 30.0,\n",
       " 32.5,\n",
       " 45.0,\n",
       " 20.0,\n",
       " 12.5,\n",
       " 20.0,\n",
       " 23.3,\n",
       " 22.5,\n",
       " 56.2,\n",
       " 22.5,\n",
       " 20.0,\n",
       " 46.9,\n",
       " 32.5,\n",
       " 37.5,\n",
       " 73.3,\n",
       " 37.5,\n",
       " 65.0,\n",
       " 40.0,\n",
       " 27.5,\n",
       " 22.5,\n",
       " 40.0,\n",
       " 19.0,\n",
       " 27.5,\n",
       " 28.1,\n",
       " 13.5,\n",
       " 29.8,\n",
       " 40.0,\n",
       " 37.5,\n",
       " 30.0,\n",
       " 35.2,\n",
       " 50.0,\n",
       " 35.0,\n",
       " 40.0,\n",
       " 53.1,\n",
       " 24.4,\n",
       " 49.6,\n",
       " 17.5,\n",
       " 31.5,\n",
       " 46.7,\n",
       " 27.1,\n",
       " 32.2,\n",
       " 43.8,\n",
       " 40.0,\n",
       " 15.0,\n",
       " 40.6,\n",
       " 20.0,\n",
       " 27.1,\n",
       " 80.0,\n",
       " 20.0,\n",
       " 87.5,\n",
       " 22.2,\n",
       " 22.5,\n",
       " 37.5,\n",
       " 17.5,\n",
       " 60.0,\n",
       " 20.0,\n",
       " 43.8,\n",
       " 70.0,\n",
       " 37.9,\n",
       " 40.6,\n",
       " 26.5,\n",
       " 26.2,\n",
       " 37.9,\n",
       " 27.3,\n",
       " 29.8,\n",
       " 32.5,\n",
       " 14.6,\n",
       " 37.5,\n",
       " 30.0,\n",
       " 37.9,\n",
       " 40.6,\n",
       " 22.5,\n",
       " 28.1,\n",
       " 29.8,\n",
       " 60.0,\n",
       " 20.0,\n",
       " 27.5,\n",
       " 26.7,\n",
       " 22.5,\n",
       " 40.0,\n",
       " 23.0,\n",
       " 87.5,\n",
       " 37.5,\n",
       " 46.0,\n",
       " 26.2,\n",
       " 22.5,\n",
       " 42.5,\n",
       " 20.0,\n",
       " 60.0,\n",
       " 20.0,\n",
       " 59.4,\n",
       " 23.3,\n",
       " 40.0,\n",
       " 77.9,\n",
       " 29.8,\n",
       " 52.5,\n",
       " 29.8,\n",
       " 39.7,\n",
       " 46.9,\n",
       " 15.0,\n",
       " 23.3,\n",
       " 86.2,\n",
       " 18.5]"
      ]
     },
     "execution_count": 12,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "平均薪资 = [ (int(i[0].split('-')[0]) +int(i[0].split('-')[1].split('k')[0]))/2    \\\n",
    " if len(i)==1 else round((int(i[0].split('-')[0]) +int(i[0].split('-')[1].split('k')[0]))/2*int(i[1])/12,1)     \\\n",
    " for i in 非薪资面议_detail        ] \n",
    "平均薪资"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 13,
   "id": "dbe1802f",
   "metadata": {},
   "outputs": [
    {
     "name": "stderr",
     "output_type": "stream",
     "text": [
      "C:\\Users\\86135\\AppData\\Local\\Temp\\ipykernel_5180\\3673931196.py:1: SettingWithCopyWarning: \n",
      "A value is trying to be set on a copy of a slice from a DataFrame.\n",
      "Try using .loc[row_indexer,col_indexer] = value instead\n",
      "\n",
      "See the caveats in the documentation: https://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy\n",
      "  非薪资面议['平均薪资']=平均薪资\n"
     ]
    },
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>job.labels</th>\n",
       "      <th>job.refreshTime</th>\n",
       "      <th>job.title</th>\n",
       "      <th>job.salary</th>\n",
       "      <th>job.dq</th>\n",
       "      <th>job.topJob</th>\n",
       "      <th>job.requireWorkYears</th>\n",
       "      <th>job.requireEduLevel</th>\n",
       "      <th>comp.compStage</th>\n",
       "      <th>comp.compName</th>\n",
       "      <th>comp.compIndustry</th>\n",
       "      <th>comp.compScale</th>\n",
       "      <th>平均薪资</th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>0</th>\n",
       "      <td>[电子/半导体/集成电路, 产品规划, 需求分析, 用户研究, 市场调研, 数据分析, 存储...</td>\n",
       "      <td>20231103092225</td>\n",
       "      <td>PM产品经理</td>\n",
       "      <td>20-40k·14薪</td>\n",
       "      <td>深圳-南山区</td>\n",
       "      <td>True</td>\n",
       "      <td>5-10年</td>\n",
       "      <td>统招本科</td>\n",
       "      <td>A轮</td>\n",
       "      <td>联和存储科技(江苏)有限公司</td>\n",
       "      <td>电子/半导体/集成电路</td>\n",
       "      <td>50-99人</td>\n",
       "      <td>35.0</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2</th>\n",
       "      <td>[平台产品, 竞品分析, 策略产品, 商业产品, 产品优化, 产品规划, 产品设计, 医疗行...</td>\n",
       "      <td>20231023105635</td>\n",
       "      <td>高级产品经理</td>\n",
       "      <td>20-30k</td>\n",
       "      <td>深圳-光明区</td>\n",
       "      <td>True</td>\n",
       "      <td>5-10年</td>\n",
       "      <td>本科</td>\n",
       "      <td>天使轮</td>\n",
       "      <td>倍生生物科技(深圳)有限公司</td>\n",
       "      <td>生物技术</td>\n",
       "      <td>100-499人</td>\n",
       "      <td>25.0</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>4</th>\n",
       "      <td>[数据产品, 供应链产品, 电子/半导体/集成电路, 电子商务, 产品规划, 用户研究, 市...</td>\n",
       "      <td>20231016141158</td>\n",
       "      <td>产品经理 PM（NXP）</td>\n",
       "      <td>15-20k·15薪</td>\n",
       "      <td>深圳</td>\n",
       "      <td>False</td>\n",
       "      <td>5-10年</td>\n",
       "      <td>本科</td>\n",
       "      <td>NaN</td>\n",
       "      <td>中电港</td>\n",
       "      <td>电子/半导体/集成电路</td>\n",
       "      <td>500-999人</td>\n",
       "      <td>21.9</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>6</th>\n",
       "      <td>[功率器件]</td>\n",
       "      <td>20231104101628</td>\n",
       "      <td>产品经理PM</td>\n",
       "      <td>20-40k·13薪</td>\n",
       "      <td>深圳-宝安区</td>\n",
       "      <td>False</td>\n",
       "      <td>5-10年</td>\n",
       "      <td>本科</td>\n",
       "      <td>A轮</td>\n",
       "      <td>某深圳大型电子/半导体/集成电路公司</td>\n",
       "      <td>电子/半导体/集成电路</td>\n",
       "      <td>500-999人</td>\n",
       "      <td>32.5</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>7</th>\n",
       "      <td>[产品规划, 市场调研, 用户研究, 电子商务]</td>\n",
       "      <td>20230928114733</td>\n",
       "      <td>PM产品经理</td>\n",
       "      <td>13-16k</td>\n",
       "      <td>深圳-南山区</td>\n",
       "      <td>False</td>\n",
       "      <td>5-10年</td>\n",
       "      <td>大专</td>\n",
       "      <td>NaN</td>\n",
       "      <td>深圳中积芯科技有限公司</td>\n",
       "      <td>电子/半导体/集成电路</td>\n",
       "      <td>1-49人</td>\n",
       "      <td>14.5</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>...</th>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>34</th>\n",
       "      <td>[OA产品]</td>\n",
       "      <td>20230907193745</td>\n",
       "      <td>高级产品经理（OA系统）</td>\n",
       "      <td>30-45k·15薪</td>\n",
       "      <td>深圳-福田区</td>\n",
       "      <td>False</td>\n",
       "      <td>10年以上</td>\n",
       "      <td>学历不限</td>\n",
       "      <td>不需要融资</td>\n",
       "      <td>某深圳大型批发/零售公司</td>\n",
       "      <td>批发/零售</td>\n",
       "      <td>10000人以上</td>\n",
       "      <td>46.9</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>35</th>\n",
       "      <td>[硬件产品, 定价/竞价产品, 策略产品, 硬件系统集成, 智能硬件, 产品规划, 产品设计...</td>\n",
       "      <td>20230823112426</td>\n",
       "      <td>空气净化器产品经理</td>\n",
       "      <td>12-18k</td>\n",
       "      <td>深圳-宝安区</td>\n",
       "      <td>False</td>\n",
       "      <td>1-3年</td>\n",
       "      <td>本科</td>\n",
       "      <td>A轮</td>\n",
       "      <td>某北京电子商务公司</td>\n",
       "      <td>电子商务</td>\n",
       "      <td>100-499人</td>\n",
       "      <td>15.0</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>36</th>\n",
       "      <td>[供应链产品, 数据产品, AI人工智能产品, 物流快递, 交通/运输/物流, IT互联网,...</td>\n",
       "      <td>20230427154916</td>\n",
       "      <td>JTY-产品经理</td>\n",
       "      <td>15-25k·14薪</td>\n",
       "      <td>深圳-南山区</td>\n",
       "      <td>False</td>\n",
       "      <td>3-5年</td>\n",
       "      <td>本科</td>\n",
       "      <td>NaN</td>\n",
       "      <td>顺丰科技</td>\n",
       "      <td>互联网</td>\n",
       "      <td>2000-5000人</td>\n",
       "      <td>23.3</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>38</th>\n",
       "      <td>[交易产品, IT互联网, 证券业, 团队管理]</td>\n",
       "      <td>20230815140559</td>\n",
       "      <td>交易系统服务产品经理</td>\n",
       "      <td>50-65k·18薪</td>\n",
       "      <td>深圳-福田区</td>\n",
       "      <td>False</td>\n",
       "      <td>5-10年</td>\n",
       "      <td>本科</td>\n",
       "      <td>融资未公开</td>\n",
       "      <td>某知名公司</td>\n",
       "      <td>电子商务</td>\n",
       "      <td>10000人以上</td>\n",
       "      <td>86.2</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>39</th>\n",
       "      <td>[支付产品, to G, to B, 消费券, 政府权益, 银行合作]</td>\n",
       "      <td>20230823102535</td>\n",
       "      <td>权益产品经理 (MJ001105)</td>\n",
       "      <td>14-23k</td>\n",
       "      <td>深圳</td>\n",
       "      <td>False</td>\n",
       "      <td>1-3年</td>\n",
       "      <td>本科</td>\n",
       "      <td>创业板上市</td>\n",
       "      <td>深圳市新国都股份有限公司</td>\n",
       "      <td>智能硬件</td>\n",
       "      <td>5000-10000人</td>\n",
       "      <td>18.5</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "<p>348 rows × 13 columns</p>\n",
       "</div>"
      ],
      "text/plain": [
       "                                           job.labels job.refreshTime  \\\n",
       "0   [电子/半导体/集成电路, 产品规划, 需求分析, 用户研究, 市场调研, 数据分析, 存储...  20231103092225   \n",
       "2   [平台产品, 竞品分析, 策略产品, 商业产品, 产品优化, 产品规划, 产品设计, 医疗行...  20231023105635   \n",
       "4   [数据产品, 供应链产品, 电子/半导体/集成电路, 电子商务, 产品规划, 用户研究, 市...  20231016141158   \n",
       "6                                              [功率器件]  20231104101628   \n",
       "7                            [产品规划, 市场调研, 用户研究, 电子商务]  20230928114733   \n",
       "..                                                ...             ...   \n",
       "34                                             [OA产品]  20230907193745   \n",
       "35  [硬件产品, 定价/竞价产品, 策略产品, 硬件系统集成, 智能硬件, 产品规划, 产品设计...  20230823112426   \n",
       "36  [供应链产品, 数据产品, AI人工智能产品, 物流快递, 交通/运输/物流, IT互联网,...  20230427154916   \n",
       "38                           [交易产品, IT互联网, 证券业, 团队管理]  20230815140559   \n",
       "39                [支付产品, to G, to B, 消费券, 政府权益, 银行合作]  20230823102535   \n",
       "\n",
       "            job.title  job.salary  job.dq  job.topJob job.requireWorkYears  \\\n",
       "0              PM产品经理  20-40k·14薪  深圳-南山区        True                5-10年   \n",
       "2              高级产品经理      20-30k  深圳-光明区        True                5-10年   \n",
       "4        产品经理 PM（NXP）  15-20k·15薪      深圳       False                5-10年   \n",
       "6              产品经理PM  20-40k·13薪  深圳-宝安区       False                5-10年   \n",
       "7              PM产品经理      13-16k  深圳-南山区       False                5-10年   \n",
       "..                ...         ...     ...         ...                  ...   \n",
       "34       高级产品经理（OA系统）  30-45k·15薪  深圳-福田区       False                10年以上   \n",
       "35          空气净化器产品经理      12-18k  深圳-宝安区       False                 1-3年   \n",
       "36           JTY-产品经理  15-25k·14薪  深圳-南山区       False                 3-5年   \n",
       "38         交易系统服务产品经理  50-65k·18薪  深圳-福田区       False                5-10年   \n",
       "39  权益产品经理 (MJ001105)      14-23k      深圳       False                 1-3年   \n",
       "\n",
       "   job.requireEduLevel comp.compStage       comp.compName comp.compIndustry  \\\n",
       "0                 统招本科             A轮      联和存储科技(江苏)有限公司       电子/半导体/集成电路   \n",
       "2                   本科            天使轮      倍生生物科技(深圳)有限公司              生物技术   \n",
       "4                   本科            NaN                 中电港       电子/半导体/集成电路   \n",
       "6                   本科             A轮  某深圳大型电子/半导体/集成电路公司       电子/半导体/集成电路   \n",
       "7                   大专            NaN         深圳中积芯科技有限公司       电子/半导体/集成电路   \n",
       "..                 ...            ...                 ...               ...   \n",
       "34                学历不限          不需要融资        某深圳大型批发/零售公司             批发/零售   \n",
       "35                  本科             A轮           某北京电子商务公司              电子商务   \n",
       "36                  本科            NaN                顺丰科技               互联网   \n",
       "38                  本科          融资未公开               某知名公司              电子商务   \n",
       "39                  本科          创业板上市        深圳市新国都股份有限公司              智能硬件   \n",
       "\n",
       "   comp.compScale  平均薪资  \n",
       "0          50-99人  35.0  \n",
       "2        100-499人  25.0  \n",
       "4        500-999人  21.9  \n",
       "6        500-999人  32.5  \n",
       "7           1-49人  14.5  \n",
       "..            ...   ...  \n",
       "34       10000人以上  46.9  \n",
       "35       100-499人  15.0  \n",
       "36     2000-5000人  23.3  \n",
       "38       10000人以上  86.2  \n",
       "39    5000-10000人  18.5  \n",
       "\n",
       "[348 rows x 13 columns]"
      ]
     },
     "execution_count": 13,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "非薪资面议['平均薪资']=平均薪资\n",
    "非薪资面议"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 14,
   "id": "00e39cf5",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>平均薪资</th>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>job.requireWorkYears</th>\n",
       "      <th></th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>1-3年</th>\n",
       "      <td>15.25</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>10年以上</th>\n",
       "      <td>46.90</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3-5年</th>\n",
       "      <td>21.70</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>5-10年</th>\n",
       "      <td>33.80</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>一年以下</th>\n",
       "      <td>12.50</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "                       平均薪资\n",
       "job.requireWorkYears       \n",
       "1-3年                  15.25\n",
       "10年以上                 46.90\n",
       "3-5年                  21.70\n",
       "5-10年                 33.80\n",
       "一年以下                  12.50"
      ]
     },
     "execution_count": 14,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "# 分工作年限平均薪资\n",
    "分年限_平均薪资 = 非薪资面议.groupby('job.requireWorkYears').agg({'平均薪资':'median'})\n",
    "分年限_平均薪资"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 15,
   "id": "3d414b8a",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>平均薪资</th>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>job.requireWorkYears</th>\n",
       "      <th></th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>一年以下</th>\n",
       "      <td>12.50</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1-3年</th>\n",
       "      <td>15.25</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>3-5年</th>\n",
       "      <td>21.70</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>5-10年</th>\n",
       "      <td>33.80</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>10年以上</th>\n",
       "      <td>46.90</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "                       平均薪资\n",
       "job.requireWorkYears       \n",
       "一年以下                  12.50\n",
       "1-3年                  15.25\n",
       "3-5年                  21.70\n",
       "5-10年                 33.80\n",
       "10年以上                 46.90"
      ]
     },
     "execution_count": 15,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "年限 = 分年限_平均薪资.reindex(index=[\"一年以下\",\"1-3年\",\"3-5年\",\"5-10年\",\"10年以上\"])\n",
    "年限"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 16,
   "id": "b42fa24f",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "[12.5, 15.2, 21.7, 33.8, 46.9]"
      ]
     },
     "execution_count": 16,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "分年限_平均薪资_values = [round(i[0], 1) for i in 年限.values.tolist()]\n",
    "分年限_平均薪资_values"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 17,
   "id": "d586f464",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "['一年以下', '1-3年', '3-5年', '5-10年', '10年以上']"
      ]
     },
     "execution_count": 17,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "分年限_平均薪资_index = 年限.index.tolist()\n",
    "分年限_平均薪资_index"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "0eca3a2a",
   "metadata": {},
   "source": [
    "# 词云图"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 18,
   "id": "340549fe",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "0                                       PM产品经理\n",
       "1                              产品经理 (MJ000380)\n",
       "2                                       高级产品经理\n",
       "3     Lazada-Data Product Manager-流量数据产品经理（深圳）\n",
       "4                                 产品经理 PM（NXP）\n",
       "                        ...                   \n",
       "35                                   空气净化器产品经理\n",
       "36                                    JTY-产品经理\n",
       "37                                      高级产品经理\n",
       "38                                  交易系统服务产品经理\n",
       "39                           权益产品经理 (MJ001105)\n",
       "Name: job.title, Length: 400, dtype: object"
      ]
     },
     "execution_count": 18,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    " df_PM_gz['job.title']"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 19,
   "id": "fc8edf31",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "3     Lazada-Data Product Manager-流量数据产品经理\n",
       "4                                  产品经理 PM\n",
       "14                                    产品经理\n",
       "16                                  产品线经理 \n",
       "25                                    产品经理\n",
       "                      ...                 \n",
       "6                                高级互联网产品经理\n",
       "7                                     产品经理\n",
       "18                                    产品经理\n",
       "21                                    产品经理\n",
       "34                                  高级产品经理\n",
       "Name: job.title, Length: 116, dtype: object"
      ]
     },
     "execution_count": 19,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "# 还要合并回去原来的行\n",
    "df_PM_gz['job.title'][   df_PM_gz['job.title'].str.contains('（')   ].str.split('（').apply(lambda x:x[0])"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 20,
   "id": "75c812d4",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "产品经理       91\n",
       "高级产品经理     19\n",
       "PM         17\n",
       "PM产品经理     15\n",
       "产品经理PM     10\n",
       "           ..\n",
       "元器件产品经理     1\n",
       "PM 电容       1\n",
       "被动器件PM      1\n",
       "销售工程师       1\n",
       "权益产品经理      1\n",
       "Name: job.title, Length: 198, dtype: int64"
      ]
     },
     "execution_count": 20,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "# 处理过一些，清洗后的数据\n",
    "df_job_title = df_PM_gz['job.title'].apply(lambda x:x.split('（')[0].split('/')[0].split('(')[0]).value_counts()\n",
    "df_job_title"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 21,
   "id": "a2da302c",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "['产品经理',\n",
       " '高级产品经理',\n",
       " 'PM',\n",
       " 'PM产品经理',\n",
       " '产品经理PM',\n",
       " '软件产品经理',\n",
       " 'pm产品经理',\n",
       " '芯片产品经理',\n",
       " 'Product Manager',\n",
       " '智能硬件产品经理',\n",
       " '产品线经理',\n",
       " '资深产品经理',\n",
       " '产品经理 ',\n",
       " 'PM产品线经理',\n",
       " '产品经理助理',\n",
       " '硬件产品经理',\n",
       " 'B端产品经理',\n",
       " 'OA产品经理',\n",
       " '菜鸟-头程产品经理',\n",
       " 'MCU产品经理',\n",
       " '商城产品经理',\n",
       " '助理产品经理',\n",
       " 'Product Manager ',\n",
       " '市场PM',\n",
       " '产品经理 PM',\n",
       " 'PM[深圳]',\n",
       " '高级人工智能',\n",
       " 'CT产品经理',\n",
       " '高级硬件产品经理',\n",
       " '无忧行app-商城产品经理',\n",
       " '券商系统高级产品经理',\n",
       " '管理会计资深产品经理',\n",
       " '0332NF-资深产品经理',\n",
       " '智能家居电工产品经理',\n",
       " '产品经理-德语',\n",
       " '消费电子类 - 产品经理',\n",
       " '云服务高级产品经理',\n",
       " '资深产品经理、产品专家',\n",
       " '跨境电商产品经理',\n",
       " '个护类产品经理',\n",
       " '产品经理-智慧停车方向',\n",
       " '产品经理-运动方向、健康方向、智慧生活方向',\n",
       " 'ipc硬件产品经理',\n",
       " '液冷产品经理',\n",
       " '家居产品经理',\n",
       " 'PMO',\n",
       " '海外支付产品经理',\n",
       " '算法产品经理',\n",
       " '仿真教育产品经理',\n",
       " '财富管理产品经理岗',\n",
       " '腾讯云高级技术服务产品经理',\n",
       " '市场产品经理',\n",
       " '钱包产品经理',\n",
       " '产品经理实习生',\n",
       " '支付产品经理',\n",
       " '高级产品经理-HR',\n",
       " '供应链金融产品经理',\n",
       " 'MCRB PM 制造项目管理',\n",
       " '游戏引擎产品经理',\n",
       " '数字财资平台产品经理',\n",
       " 'AIGC 企业级应用的产品经理',\n",
       " '临床试验项目经理PM',\n",
       " '高级IT产品经理-HR',\n",
       " '智慧文旅产品经理',\n",
       " '商旅业务产品经理',\n",
       " 'POS产品经理',\n",
       " '高级互联网产品经理',\n",
       " '储能产品经理',\n",
       " '自动化设备PM',\n",
       " '智能手表类-产品经理',\n",
       " '交易服务系统产品经理',\n",
       " '供应链领域IT产品经理',\n",
       " 'OA高级产品经理',\n",
       " '财富管理数字化产品经理岗',\n",
       " '大数据产品经理',\n",
       " '数据产品经理',\n",
       " '高级软件产品经理-操作系统',\n",
       " '助理app产品经理',\n",
       " '高级风控数据产品经理',\n",
       " '高级智能投顾产品经理',\n",
       " 'ISP产品经理',\n",
       " 'PM顾问',\n",
       " '空气净化器产品经理',\n",
       " 'JTY-产品经理',\n",
       " '交易系统服务产品经理',\n",
       " '玉湖冷链总部供应链金融产品经理',\n",
       " '产品负责人',\n",
       " 'CAD',\n",
       " '动力电池Pack 产品高级经理',\n",
       " '数字化平台产品经理',\n",
       " '智慧工地产品经理-央企旗下科技公司',\n",
       " '数字化产品经理管理岗',\n",
       " 'B端高级产品经理',\n",
       " '',\n",
       " '平台产品经理',\n",
       " '军事产品经理',\n",
       " '证券App产品经理',\n",
       " '产品经理&工程师',\n",
       " 'HR系统产品经理',\n",
       " 'SAP PM顾问',\n",
       " 'IT产品经理',\n",
       " '高级产品经理-文档方向',\n",
       " '动力电池Pack产品经理',\n",
       " '产品经理-北美市场',\n",
       " '全球医工发展服务产品经理',\n",
       " '数字化平台产品经理岗产品经理',\n",
       " '产品经理-母婴配件',\n",
       " '时空组学产品经理',\n",
       " 'Website Product Manager ',\n",
       " 'MCU PM',\n",
       " '产品经理-海外交易柜台方向',\n",
       " 'PM产品经理-Asmedia',\n",
       " 'PM产品经理-NXP',\n",
       " 'SSMR MR Product Manager 磁共振产品经理',\n",
       " 'MOS管 PM产品经理',\n",
       " 'Trade Product Manager 贸易服务部产品经理 ',\n",
       " '新能源pm产品经理',\n",
       " 'PM商品经理',\n",
       " '芯片PM',\n",
       " '通讯模组PM产品经理',\n",
       " '食品产品经理PM',\n",
       " 'MCU 产品经理PM',\n",
       " '元器件、半导体器件PM经理',\n",
       " 'PM阻容产品经理',\n",
       " 'Product Manager  ',\n",
       " '产品市场经理PM ',\n",
       " 'PM产品市场经理',\n",
       " '高级产品经理 某全球性上市奢侈品跨境电商平台',\n",
       " '华大时空国际产品经理',\n",
       " '产品经理 for ST',\n",
       " 'Product manager 产品经理',\n",
       " '外籍产品经理',\n",
       " '产品经理Product Manager',\n",
       " '海外项目工程师 Overseas PM',\n",
       " 'Lazada-Data Product Manager-流量数据产品经理',\n",
       " '产品经理 PLM PM',\n",
       " '【科技公司】BIM产品经理',\n",
       " '特效工具产品经理-智能创作  Effects Tools Product Manager-Intelligent Creation',\n",
       " 'PM 产品经理',\n",
       " '产品线经理 ',\n",
       " '产品线经理PM 交通补贴 ',\n",
       " '临床项目经理 PM',\n",
       " '代理线PM产品经理',\n",
       " '产品经理PM+电子元器件',\n",
       " '元器件产品经理PM',\n",
       " '通讯事业部-产品经理',\n",
       " '芯片产品经理PM',\n",
       " '代理商产品经理',\n",
       " '电源类产品经理  PM',\n",
       " 'IT Product Manager',\n",
       " '产品经理 Product Manager',\n",
       " '电源芯片类产品经理PM  偏销售',\n",
       " '储备PM',\n",
       " '钽电容产品经理',\n",
       " '电子元器件产品专员',\n",
       " '战略产品经理',\n",
       " 'Product Manager - Linear ',\n",
       " 'pm经理',\n",
       " '项目经理',\n",
       " '产品市场PM',\n",
       " '半导体产品经理',\n",
       " 'Product manager-APAC',\n",
       " '激光雷达PM',\n",
       " '产品线管理',\n",
       " 'TV Product Manager',\n",
       " '笔记本pm',\n",
       " 'CRM实施顾问',\n",
       " '电源IC PM',\n",
       " '传感器sensor FAE 产品经理',\n",
       " '变频器',\n",
       " 'BA',\n",
       " '项目经理pm',\n",
       " 'Senior Product Manager',\n",
       " 'Lazada-Product-Performance Marketing Product Manager',\n",
       " ' Digital Product Manager ',\n",
       " 'PM-半导体产品经理',\n",
       " 'Global Product Manager  ',\n",
       " 'Software',\n",
       " '036173-产品经理',\n",
       " '项目',\n",
       " '安世PM',\n",
       " 'APP产品经理',\n",
       " 'C端产品经理',\n",
       " '芯片代理行业产品经理',\n",
       " '解决方案产品经理',\n",
       " '跨境物流产品经理',\n",
       " '0361N2-产品经理',\n",
       " '日语产品经理',\n",
       " 'Sr. Product Manager - Device ',\n",
       " '国际化产品研发中心',\n",
       " '0351WC-产品经理',\n",
       " 'Lazada-Product Manager-Performance marketing staff product manager-深圳',\n",
       " 'FPGA产品经理',\n",
       " '元器件产品经理',\n",
       " 'PM 电容',\n",
       " '被动器件PM',\n",
       " '销售工程师',\n",
       " '权益产品经理 ']"
      ]
     },
     "execution_count": 21,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "df_job_title.index.tolist()"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 22,
   "id": "81680d4c",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "[('高级产品经理', 19),\n",
       " ('PM', 17),\n",
       " ('PM产品经理', 15),\n",
       " ('产品经理PM', 10),\n",
       " ('软件产品经理', 7),\n",
       " ('pm产品经理', 7),\n",
       " ('芯片产品经理', 6),\n",
       " ('Product Manager', 6),\n",
       " ('智能硬件产品经理', 5),\n",
       " ('产品线经理', 5),\n",
       " ('资深产品经理', 5),\n",
       " ('产品经理 ', 4),\n",
       " ('PM产品线经理', 3),\n",
       " ('产品经理助理', 3),\n",
       " ('硬件产品经理', 3),\n",
       " ('B端产品经理', 3),\n",
       " ('OA产品经理', 3),\n",
       " ('菜鸟-头程产品经理', 2),\n",
       " ('MCU产品经理', 2),\n",
       " ('商城产品经理', 2),\n",
       " ('助理产品经理', 2),\n",
       " ('Product Manager ', 2),\n",
       " ('市场PM', 2),\n",
       " ('产品经理 PM', 2),\n",
       " ('PM[深圳]', 2),\n",
       " ('高级人工智能', 1),\n",
       " ('CT产品经理', 1),\n",
       " ('高级硬件产品经理', 1),\n",
       " ('无忧行app-商城产品经理', 1),\n",
       " ('券商系统高级产品经理', 1),\n",
       " ('管理会计资深产品经理', 1),\n",
       " ('0332NF-资深产品经理', 1),\n",
       " ('智能家居电工产品经理', 1),\n",
       " ('产品经理-德语', 1),\n",
       " ('消费电子类 - 产品经理', 1),\n",
       " ('云服务高级产品经理', 1),\n",
       " ('资深产品经理、产品专家', 1),\n",
       " ('跨境电商产品经理', 1),\n",
       " ('个护类产品经理', 1),\n",
       " ('产品经理-智慧停车方向', 1),\n",
       " ('产品经理-运动方向、健康方向、智慧生活方向', 1),\n",
       " ('ipc硬件产品经理', 1),\n",
       " ('液冷产品经理', 1),\n",
       " ('家居产品经理', 1),\n",
       " ('PMO', 1),\n",
       " ('海外支付产品经理', 1),\n",
       " ('算法产品经理', 1),\n",
       " ('仿真教育产品经理', 1),\n",
       " ('财富管理产品经理岗', 1),\n",
       " ('腾讯云高级技术服务产品经理', 1),\n",
       " ('市场产品经理', 1),\n",
       " ('钱包产品经理', 1),\n",
       " ('产品经理实习生', 1),\n",
       " ('支付产品经理', 1),\n",
       " ('高级产品经理-HR', 1),\n",
       " ('供应链金融产品经理', 1),\n",
       " ('MCRB PM 制造项目管理', 1),\n",
       " ('游戏引擎产品经理', 1),\n",
       " ('数字财资平台产品经理', 1),\n",
       " ('AIGC 企业级应用的产品经理', 1),\n",
       " ('临床试验项目经理PM', 1),\n",
       " ('高级IT产品经理-HR', 1),\n",
       " ('智慧文旅产品经理', 1),\n",
       " ('商旅业务产品经理', 1),\n",
       " ('POS产品经理', 1),\n",
       " ('高级互联网产品经理', 1),\n",
       " ('储能产品经理', 1),\n",
       " ('自动化设备PM', 1),\n",
       " ('智能手表类-产品经理', 1),\n",
       " ('交易服务系统产品经理', 1),\n",
       " ('供应链领域IT产品经理', 1),\n",
       " ('OA高级产品经理', 1),\n",
       " ('财富管理数字化产品经理岗', 1),\n",
       " ('大数据产品经理', 1),\n",
       " ('数据产品经理', 1),\n",
       " ('高级软件产品经理-操作系统', 1),\n",
       " ('助理app产品经理', 1),\n",
       " ('高级风控数据产品经理', 1),\n",
       " ('高级智能投顾产品经理', 1),\n",
       " ('ISP产品经理', 1),\n",
       " ('PM顾问', 1),\n",
       " ('空气净化器产品经理', 1),\n",
       " ('JTY-产品经理', 1),\n",
       " ('交易系统服务产品经理', 1),\n",
       " ('玉湖冷链总部供应链金融产品经理', 1),\n",
       " ('产品负责人', 1),\n",
       " ('CAD', 1),\n",
       " ('动力电池Pack 产品高级经理', 1),\n",
       " ('数字化平台产品经理', 1),\n",
       " ('智慧工地产品经理-央企旗下科技公司', 1),\n",
       " ('数字化产品经理管理岗', 1),\n",
       " ('B端高级产品经理', 1),\n",
       " ('', 1),\n",
       " ('平台产品经理', 1),\n",
       " ('军事产品经理', 1),\n",
       " ('证券App产品经理', 1),\n",
       " ('产品经理&工程师', 1),\n",
       " ('HR系统产品经理', 1),\n",
       " ('SAP PM顾问', 1),\n",
       " ('IT产品经理', 1),\n",
       " ('高级产品经理-文档方向', 1),\n",
       " ('动力电池Pack产品经理', 1),\n",
       " ('产品经理-北美市场', 1),\n",
       " ('全球医工发展服务产品经理', 1),\n",
       " ('数字化平台产品经理岗产品经理', 1),\n",
       " ('产品经理-母婴配件', 1),\n",
       " ('时空组学产品经理', 1),\n",
       " ('Website Product Manager ', 1),\n",
       " ('MCU PM', 1),\n",
       " ('产品经理-海外交易柜台方向', 1),\n",
       " ('PM产品经理-Asmedia', 1),\n",
       " ('PM产品经理-NXP', 1),\n",
       " ('SSMR MR Product Manager 磁共振产品经理', 1),\n",
       " ('MOS管 PM产品经理', 1),\n",
       " ('Trade Product Manager 贸易服务部产品经理 ', 1),\n",
       " ('新能源pm产品经理', 1),\n",
       " ('PM商品经理', 1),\n",
       " ('芯片PM', 1),\n",
       " ('通讯模组PM产品经理', 1),\n",
       " ('食品产品经理PM', 1),\n",
       " ('MCU 产品经理PM', 1),\n",
       " ('元器件、半导体器件PM经理', 1),\n",
       " ('PM阻容产品经理', 1),\n",
       " ('Product Manager  ', 1),\n",
       " ('产品市场经理PM ', 1),\n",
       " ('PM产品市场经理', 1),\n",
       " ('高级产品经理 某全球性上市奢侈品跨境电商平台', 1),\n",
       " ('华大时空国际产品经理', 1),\n",
       " ('产品经理 for ST', 1),\n",
       " ('Product manager 产品经理', 1),\n",
       " ('外籍产品经理', 1),\n",
       " ('产品经理Product Manager', 1),\n",
       " ('海外项目工程师 Overseas PM', 1),\n",
       " ('Lazada-Data Product Manager-流量数据产品经理', 1),\n",
       " ('产品经理 PLM PM', 1),\n",
       " ('【科技公司】BIM产品经理', 1),\n",
       " ('特效工具产品经理-智能创作  Effects Tools Product Manager-Intelligent Creation', 1),\n",
       " ('PM 产品经理', 1),\n",
       " ('产品线经理 ', 1),\n",
       " ('产品线经理PM 交通补贴 ', 1),\n",
       " ('临床项目经理 PM', 1),\n",
       " ('代理线PM产品经理', 1),\n",
       " ('产品经理PM+电子元器件', 1),\n",
       " ('元器件产品经理PM', 1),\n",
       " ('通讯事业部-产品经理', 1),\n",
       " ('芯片产品经理PM', 1),\n",
       " ('代理商产品经理', 1),\n",
       " ('电源类产品经理  PM', 1),\n",
       " ('IT Product Manager', 1),\n",
       " ('产品经理 Product Manager', 1),\n",
       " ('电源芯片类产品经理PM  偏销售', 1),\n",
       " ('储备PM', 1),\n",
       " ('钽电容产品经理', 1),\n",
       " ('电子元器件产品专员', 1),\n",
       " ('战略产品经理', 1),\n",
       " ('Product Manager - Linear ', 1),\n",
       " ('pm经理', 1),\n",
       " ('项目经理', 1),\n",
       " ('产品市场PM', 1),\n",
       " ('半导体产品经理', 1),\n",
       " ('Product manager-APAC', 1),\n",
       " ('激光雷达PM', 1),\n",
       " ('产品线管理', 1),\n",
       " ('TV Product Manager', 1),\n",
       " ('笔记本pm', 1),\n",
       " ('CRM实施顾问', 1),\n",
       " ('电源IC PM', 1),\n",
       " ('传感器sensor FAE 产品经理', 1),\n",
       " ('变频器', 1),\n",
       " ('BA', 1),\n",
       " ('项目经理pm', 1),\n",
       " ('Senior Product Manager', 1),\n",
       " ('Lazada-Product-Performance Marketing Product Manager', 1),\n",
       " (' Digital Product Manager ', 1),\n",
       " ('PM-半导体产品经理', 1),\n",
       " ('Global Product Manager  ', 1),\n",
       " ('Software', 1),\n",
       " ('036173-产品经理', 1),\n",
       " ('项目', 1),\n",
       " ('安世PM', 1),\n",
       " ('APP产品经理', 1),\n",
       " ('C端产品经理', 1),\n",
       " ('芯片代理行业产品经理', 1),\n",
       " ('解决方案产品经理', 1),\n",
       " ('跨境物流产品经理', 1),\n",
       " ('0361N2-产品经理', 1),\n",
       " ('日语产品经理', 1),\n",
       " ('Sr. Product Manager - Device ', 1),\n",
       " ('国际化产品研发中心', 1),\n",
       " ('0351WC-产品经理', 1),\n",
       " ('Lazada-Product Manager-Performance marketing staff product manager-深圳', 1),\n",
       " ('FPGA产品经理', 1),\n",
       " ('元器件产品经理', 1),\n",
       " ('PM 电容', 1),\n",
       " ('被动器件PM', 1),\n",
       " ('销售工程师', 1),\n",
       " ('权益产品经理 ', 1)]"
      ]
     },
     "execution_count": 22,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "# 列表推导式\n",
    "PM_title_words = [(  df_job_title.index.tolist()[i]   ,   df_job_title.values.tolist()[i]  )    for i in range(1,len(df_job_title.index.tolist())) ]\n",
    "PM_title_words"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "5fdc7416",
   "metadata": {},
   "source": [
    "# 相关岗位的平均薪资"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 23,
   "id": "f00bc79a",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>平均薪资</th>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>job.title</th>\n",
       "      <th></th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>PM产品经理</th>\n",
       "      <td>15.85</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>产品经理</th>\n",
       "      <td>25.00</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>产品经理PM</th>\n",
       "      <td>24.80</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>软件产品经理</th>\n",
       "      <td>31.35</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>高级产品经理</th>\n",
       "      <td>42.80</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "            平均薪资\n",
       "job.title       \n",
       "PM产品经理     15.85\n",
       "产品经理       25.00\n",
       "产品经理PM     24.80\n",
       "软件产品经理     31.35\n",
       "高级产品经理     42.80"
      ]
     },
     "execution_count": 23,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "# 先统计每个job.title的数量\n",
    "job_title_counts = 非薪资面议['job.title'].value_counts()\n",
    "\n",
    "# 获取数量前五的job.title\n",
    "top_5_job_titles = job_title_counts.head(5).index.fillna('')\n",
    "\n",
    "# 根据数量前五的job.title筛选数据\n",
    "分岗位_平均薪资 = 非薪资面议[非薪资面议['job.title'].isin(top_5_job_titles)].groupby('job.title').agg({'平均薪资':'median'})\n",
    "\n",
    "分岗位_平均薪资\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 24,
   "id": "0806497b",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "[15.8, 25.0, 24.8, 31.4, 42.8]"
      ]
     },
     "execution_count": 24,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "分岗位_平均薪资_values = [round(i[0], 1) for i in 分岗位_平均薪资.values.tolist()]\n",
    "分岗位_平均薪资_values\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 25,
   "id": "d6f09ed2",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "['PM产品经理', '产品经理', '产品经理PM', '软件产品经理', '高级产品经理']"
      ]
     },
     "execution_count": 25,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "分岗位_平均薪资_index = 分岗位_平均薪资.index.tolist()\n",
    "分岗位_平均薪资_index"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "46534f18",
   "metadata": {},
   "source": [
    "# 饼图"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 26,
   "id": "058a213d",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>平均薪资</th>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>comp.compScale</th>\n",
       "      <th></th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <th>1-49人</th>\n",
       "      <td>17.5</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>100-499人</th>\n",
       "      <td>24.4</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>1000-2000人</th>\n",
       "      <td>27.8</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>10000人以上</th>\n",
       "      <td>37.5</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>2000-5000人</th>\n",
       "      <td>27.5</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>50-99人</th>\n",
       "      <td>20.0</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>500-999人</th>\n",
       "      <td>23.7</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>5000-10000人</th>\n",
       "      <td>29.6</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "</div>"
      ],
      "text/plain": [
       "                平均薪资\n",
       "comp.compScale      \n",
       "1-49人           17.5\n",
       "100-499人        24.4\n",
       "1000-2000人      27.8\n",
       "10000人以上        37.5\n",
       "2000-5000人      27.5\n",
       "50-99人          20.0\n",
       "500-999人        23.7\n",
       "5000-10000人     29.6"
      ]
     },
     "execution_count": 26,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "# 首先去除'comp.compScale'中没有文字的行\n",
    "非薪资面议 = 非薪资面议[非薪资面议['comp.compScale'].str.strip() != '']\n",
    "\n",
    "\n",
    "# 先统计每个job.title的数量\n",
    "job_title_counts = 非薪资面议['comp.compScale'].value_counts()\n",
    "\n",
    "# 获取数量前五的job.title\n",
    "top_5_job_titles = job_title_counts.head(9).index\n",
    "\n",
    "# 根据数量前五的job.title筛选数据\n",
    "公司规模 = 非薪资面议[非薪资面议['comp.compScale'].isin(top_5_job_titles)].groupby('comp.compScale').agg({'平均薪资':'median'})\n",
    "公司规模\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 27,
   "id": "53e69832",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "[17.5, 24.4, 27.8, 37.5, 27.5, 20.0, 23.7, 29.6]"
      ]
     },
     "execution_count": 27,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "公司规模_薪资 = [round(i[0], 1) for i in 公司规模.values.tolist()]\n",
    "公司规模_薪资\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 28,
   "id": "f9f1390a",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "['1-49人',\n",
       " '100-499人',\n",
       " '1000-2000人',\n",
       " '10000人以上',\n",
       " '2000-5000人',\n",
       " '50-99人',\n",
       " '500-999人',\n",
       " '5000-10000人']"
      ]
     },
     "execution_count": 28,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "公司规模_人数 = 公司规模.index.tolist()\n",
    "公司规模_人数"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 29,
   "id": "ce3537bb",
   "metadata": {},
   "outputs": [],
   "source": [
    "maps = (\n",
    "    Map()\n",
    "    .add(\"岗位数量\", [list(z) for z in zip(地区, 岗位个数)],城市)\n",
    "    .set_global_opts(\n",
    "        title_opts=opts.TitleOpts(title=城市+关键词+\"的分布地图\"), visualmap_opts=opts.VisualMapOpts()\n",
    "    )\n",
    "    #.render( 城市+关键词+\"地区分布\"+\".html\")\n",
    ")"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 30,
   "id": "53d12091",
   "metadata": {},
   "outputs": [],
   "source": [
    "import pyecharts.options as opts\n",
    "from pyecharts.charts import Line\n",
    "from pyecharts.commons.utils import JsCode\n",
    "\n",
    "\n",
    "\n",
    "background_color_js = (\n",
    "    \"new echarts.graphic.LinearGradient(0, 0, 0, 1, \"\n",
    "    \"[{offset: 0, color: '#c86589'}, {offset: 1, color: '#06a7ff'}], false)\"\n",
    ")\n",
    "area_color_js = (\n",
    "    \"new echarts.graphic.LinearGradient(0, 0, 0, 1, \"\n",
    "    \"[{offset: 0, color: '#eb64fb'}, {offset: 1, color: '#3fbbff0d'}], false)\"\n",
    ")\n",
    "\n",
    "line = (\n",
    "    Line(init_opts=opts.InitOpts(bg_color=JsCode(background_color_js)))\n",
    "    .add_xaxis(xaxis_data=分年限_平均薪资_index)  # 设置x轴数据为分年限_平均薪资_index\n",
    "    .add_yaxis(\n",
    "        series_name=\"薪资\",\n",
    "        y_axis=分年限_平均薪资_values,  # 设置y轴数据为分年限_平均薪资_values\n",
    "        is_smooth=True,\n",
    "        is_symbol_show=True,\n",
    "        symbol=\"circle\",\n",
    "        symbol_size=6,\n",
    "        linestyle_opts=opts.LineStyleOpts(color=\"#fff\"),\n",
    "        label_opts=opts.LabelOpts(is_show=True, position=\"top\", color=\"white\"),\n",
    "        itemstyle_opts=opts.ItemStyleOpts(\n",
    "            color=\"red\", border_color=\"#fff\", border_width=3\n",
    "        ),\n",
    "        tooltip_opts=opts.TooltipOpts(is_show=False),\n",
    "        areastyle_opts=opts.AreaStyleOpts(color=JsCode(area_color_js), opacity=1),\n",
    "    )\n",
    "    .set_global_opts(\n",
    "        title_opts=opts.TitleOpts(\n",
    "            title=城市+关键词+\"从业年限平均薪资\",\n",
    "            pos_left=\"5%\",  # 将标题移到左上角\n",
    "            pos_top=\"5%\",  # 将标题移到左上角\n",
    "            title_textstyle_opts=opts.TextStyleOpts(color=\"#fff\", font_size=16),\n",
    "        ),\n",
    "        xaxis_opts=opts.AxisOpts(\n",
    "            type_=\"category\",\n",
    "            boundary_gap=False,\n",
    "            axislabel_opts=opts.LabelOpts(margin=30, color=\"#ffffff63\"),\n",
    "            axisline_opts=opts.AxisLineOpts(is_show=False),\n",
    "            axistick_opts=opts.AxisTickOpts(\n",
    "                is_show=True,\n",
    "                length=25,\n",
    "                linestyle_opts=opts.LineStyleOpts(color=\"#ffffff1f\"),\n",
    "            ),\n",
    "            splitline_opts=opts.SplitLineOpts(\n",
    "                is_show=True, linestyle_opts=opts.LineStyleOpts(color=\"#ffffff1f\")\n",
    "            ),\n",
    "        ),\n",
    "        yaxis_opts=opts.AxisOpts(\n",
    "            type_=\"value\",\n",
    "            position=\"right\",\n",
    "            axislabel_opts=opts.LabelOpts(margin=20, color=\"#ffffff63\"),\n",
    "            axisline_opts=opts.AxisLineOpts(\n",
    "                linestyle_opts=opts.LineStyleOpts(width=2, color=\"#fff\")\n",
    "            ),\n",
    "            axistick_opts=opts.AxisTickOpts(\n",
    "                is_show=True,\n",
    "                length=15,\n",
    "                linestyle_opts=opts.LineStyleOpts(color=\"#ffffff1f\"),\n",
    "            ),\n",
    "            splitline_opts=opts.SplitLineOpts(\n",
    "                is_show=True, linestyle_opts=opts.LineStyleOpts(color=\"#ffffff1f\")\n",
    "            ),\n",
    "        ),\n",
    "        legend_opts=opts.LegendOpts(is_show=False),\n",
    "    )\n",
    "    #.render(\"从业年限平均薪资.html\")\n",
    "\n",
    ")\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 31,
   "id": "928eb2ff",
   "metadata": {},
   "outputs": [],
   "source": [
    "from pyecharts import options as opts\n",
    "from pyecharts.charts import WordCloud,Grid\n",
    "from pyecharts.globals import SymbolType\n",
    "\n",
    "worlds = (\n",
    "    WordCloud()\n",
    "    .add(\"\", PM_title_words, word_size_range=[20, 100], shape=SymbolType.DIAMOND)\n",
    "    .set_global_opts(title_opts=opts.TitleOpts(title=\"岗位名称词云图\"))\n",
    "    #.render( 关键词 +\"_wordcloud_map_岗位名称_\"+\".html\")\n",
    ")"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 32,
   "id": "28a968c9",
   "metadata": {},
   "outputs": [],
   "source": [
    "from pyecharts import options as opts\n",
    "from pyecharts.charts import Bar\n",
    "from pyecharts.commons.utils import JsCode\n",
    "from pyecharts.faker import Faker\n",
    "\n",
    "background_color_js = (\n",
    "    \"new echarts.graphic.LinearGradient(0, 0, 0, 1, \"\n",
    "    \"[{offset: 0, color: '#c86589'}, {offset: 1, color: '#06a7ff'}], false)\"\n",
    ")\n",
    "\n",
    "bar = (\n",
    "    Bar()\n",
    "    .add_xaxis(分岗位_平均薪资_index)\n",
    "    .add_yaxis(关键词, 分岗位_平均薪资_values, category_gap=\"70%\")\n",
    "    .set_series_opts(\n",
    "        itemstyle_opts={\n",
    "            \"normal\": {\n",
    "                \"color\": JsCode(background_color_js),\n",
    "                \"barBorderRadius\": [30, 30, 30, 30],\n",
    "                \"shadowColor\": \"rgb(0, 160, 221)\",\n",
    "            }\n",
    "        }\n",
    "    )\n",
    "    .set_global_opts(title_opts=opts.TitleOpts(title=\"相关岗位平均薪资\"),\n",
    "                    xaxis_opts=opts.AxisOpts(axislabel_opts=opts.LabelOpts(rotate=-15))\n",
    "                    )\n",
    "    #.render(\"相关岗位平均薪资.html\")\n",
    ")\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 33,
   "id": "ce06fe4e",
   "metadata": {},
   "outputs": [],
   "source": [
    "import pyecharts.options as opts\n",
    "from pyecharts.charts import Pie, Tab\n",
    "\n",
    "\"\"\"\n",
    "Gallery 使用 pyecharts 1.1.0\n",
    "参考地址: https://echarts.apache.org/examples/editor.html?c=pie-doughnut\n",
    "\n",
    "目前无法实现的功能:\n",
    "\n",
    "1、迷之颜色映射的问题\n",
    "\"\"\"\n",
    "\n",
    "x_data = 公司规模_人数\n",
    "y_data = 公司规模_薪资\n",
    "data_pair = [list(z) for z in zip(x_data, y_data)]\n",
    "data_pair.sort(key=lambda x: x[1])\n",
    "\n",
    "pies=(\n",
    "    Pie(init_opts=opts.InitOpts(bg_color=\"#2c343c\"))\n",
    "    .add(\n",
    "        series_name=\"薪资\",\n",
    "        data_pair=data_pair,\n",
    "        rosetype=\"radius\",\n",
    "        radius=\"55%\",\n",
    "        center=[\"50%\", \"50%\"],\n",
    "        label_opts=opts.LabelOpts(is_show=False, position=\"center\"),\n",
    "    )\n",
    "    .set_global_opts(\n",
    "        title_opts=opts.TitleOpts(\n",
    "            title=城市+关键词+\"的\"+\"各公司规模薪资\",\n",
    "            pos_left=\"center\",\n",
    "            pos_top=\"20\",\n",
    "            title_textstyle_opts=opts.TextStyleOpts(color=\"#fff\"),\n",
    "        ),\n",
    "        legend_opts=opts.LegendOpts(is_show=False),\n",
    "    )\n",
    "    .set_series_opts(\n",
    "        tooltip_opts=opts.TooltipOpts(\n",
    "            trigger=\"item\", formatter=\"{a} <br/>{b}: {c} ({d}%)\"\n",
    "        ),\n",
    "        label_opts=opts.LabelOpts(color=\"rgba(255, 255, 255, 0.3)\"),\n",
    "    )\n",
    "    #.render(\"饼图.html\")\n",
    ")\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 37,
   "id": "573a8f74",
   "metadata": {},
   "outputs": [],
   "source": [
    "# 创建一个 Page 对象\n",
    "page = Page(layout=Page.SimplePageLayout)\n",
    "\n",
    "# 添加图表到 Page 对象，实现横向排列两个、换行后再横向排列两个\n",
    "page.add(bar, line, pies, worlds)\n",
    "\n",
    "# 设置竖向排列的单个图表\n",
    "page.add(maps)\n",
    "\n",
    "# 渲染页面\n",
    "page.render()\n",
    "\n",
    "\n",
    "\n",
    "\n",
    "\n",
    "\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "64e68434",
   "metadata": {},
   "outputs": [],
   "source": []
  }
 ],
 "metadata": {
  "kernelspec": {
   "display_name": "Python 3 (ipykernel)",
   "language": "python",
   "name": "python3"
  },
  "language_info": {
   "codemirror_mode": {
    "name": "ipython",
    "version": 3
   },
   "file_extension": ".py",
   "mimetype": "text/x-python",
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
   "version": "3.9.12"
  },
  "toc": {
   "base_numbering": 1,
   "nav_menu": {},
   "number_sections": true,
   "sideBar": true,
   "skip_h1_title": false,
   "title_cell": "Table of Contents",
   "title_sidebar": "Contents",
   "toc_cell": false,
   "toc_position": {
    "height": "calc(100% - 180px)",
    "left": "10px",
    "top": "150px",
    "width": "240.667px"
   },
   "toc_section_display": true,
   "toc_window_display": true
  }
 },
 "nbformat": 4,
 "nbformat_minor": 5
}
